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Abstract. In this article we develop convergence theory for a general class of adaptive 
approximation algorithms for abstract nonlinear operator equations on Banach spaces, 
and then use the theory to obtain convergence results for practical adaptive finite element 
methods (AFEM) applied to several classes of nonlinear elliptic equations. In the first 
part of the paper, we develop a weak-* convergence framework for nonlinear operators, 
whose Gateaux derivatives are locally Lipschitz and satisfy a local inf-sup condition. 
The framework can be viewed as extending the recent convergence results for linear 
problems of Morin, Siebert and Veeser to a general nonlinear setting. We formulate an 
abstract adaptive approximation algorithm for nonlinear operator equations in Banach 
spaces with local structure. The weak-* convergence framework is then applied to this 
class of abstract locally adaptive algorithms, giving a general convergence result. The 
convergence result is then applied to a standard AFEM algorithm in the case of sev- 
eral semilinear and quasi-linear scalar elliptic equations and elliptic systems, including: 
a semilinear problem with subcritical nonlinearity, the steady Navier-Stokes equations, 
and a quasilinear problem with nonlinear diffusion. This yields several new AFEM 
convergence results for these nonlinear problems. In the second part of the paper we 
develop a second abstract convergence framework based on strong contraction, extend- 
ing the recent contraction results for linear problems of Gascon, Kreuzer, Nochetto, and 
Siebert and of Mekchay and Nochetto to abstract nonlinear problems. We then establish 
conditions under which it is possible to apply the contraction framework to the abstract 
adaptive algorithm defined earlier, giving a contraction result for AFEM-type algorithms 
applied to nonlinear problems. The contraction result is then applied to a standard AFEM 
algorithm in the case of several semilinear scalar elliptic equations, including: a semi- 
linear problem with subcritical nonlinearity, the Poisson-Boltzmann equation, and the 
Hamiltonian constraint in general relativity, yielding AFEM contraction results in each 
case. 
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1. Introduction 

Due to the pioneering work of Babuska and Rheinboldt (5), adaptive finite element 
methods (AFEM) based on a posteriori error estimators become standard tools in solving 
PDEs arising in scientific and engineering. A standard adaptive algorithm has the general 
iterative structure: 

Solve — )■ Estimate — )■ Mark — )■ Refine (1.1) 

where Solve computes the discrete solution in a subspace C X; Estimate com- 
putes certain error estimators based on Uk, which are reliable and efficient in the sense 
that they are good approximation of the true error u — in the energy norm; IVIarl^ ap- 
plies certain marking strategies based on the estimators; and finally, Refine divides each 
marked element and completes the mesh to to obtain a new partition, and subsequently 
an enriched subspace X^+i. The fundamental problem with the adaptive procedure (jl.ll) 
is guaranteeing convergence of the solution sequence. For a posteriori error analysis, we 
refer to the books |l2l[69l|59l and the references cited therein. 

The first convergence result for (11.11) was obtained by Babuska and Vogelius [6] for 
linear elliptic problems in one space dimension. The multi-dimensional case was open 
until Dorfler [|26l proved convergence of (|l.ll) for Poisson equation, under the assump- 
tion that the initial mesh was fine enough to resolve the influence of data oscillation. This 
result was improved by Morin, Nochetto, and Siebert [i46il . in which the convergence was 
proved without conditions on the initial mesh, but requiring the so-called interior node 
property, together with an additional marking step driven by data oscillation. Since these 
seminal papers, a number of substantial steps have been taken to generalize these con- 
vergence results for linear elliptic problems in various directions. Of particular interest 
to us here are the following. In ||49l l47l [61 il the asymptotic convergence results were 
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obtained for a general class adaptive methods for a large class of linear problems. The 
theory does not require marking due to oscillation, or the interior node property, and al- 
lows more general marking strategies than what had been used in Dorfler's arguments, 
with different a posteriori error estimators. In another direction, it was showed by Binev, 
Dahmen and DeVore [fTTI for the first time that AFEM for Poisson equation in the plane 
has optimal computational complexity by using a critical coarsening step. This result 
was improved by Stevenson [|62l by showing the optimal complexity in general spatial 
dimension without coarsening step. These error reduction and optimal complexity results 
were improved recently in several aspects in [[TTII . In the analysis of [|T71 , the artificial 
assumptions of interior node and extra marking due to data oscillation were removed, 
and the convergence result is applicable to general linear elliptic equations. The main 
ingredients of this new convergence analysis are the global upper bound on the error give 
by the a posteriori estimator, orthogonality (or possibly only quasi-orthogonality) of the 
underlying bilinear form arising from the linear problem, and a type of error indicator 
reduction produced by each step of AFEM. We refer to [50J for a recent survey of con- 
vergence analysis of AFEM for linear elliptic PDEs which gives an overview of all of 
these results through late 2009. 

There are a number of recent and not- so-recent articles concerning a posteriori error 
analysis for nonlinear partial differential equations; cf. [[8l|68ll55l|9l|5illOl|57l[39l[59l 
[T8l . However, to date there have been only a handful of AFEM convergence results for 
nonlinear problems. Some of the results are: AFEM convergence for a scalar problem 
involving the p-Laplacian was shown in [[66ll25l : AFEM convergence for a class of con- 
vex nonlinear problems arising in elasticity in ifTSl [T4l ; and AFEM convergence for the 
nonlinear Poisson-Boltzmann equation in [|20ll . These results typically involve problem- 
specific handling of the nonlinearity. A recent article in a more general direction is the 
paper of Ortner and Praetorius [52] where the convergence analysis of an adaptive algo- 
rithm for a large class of nonlinear equations is discussed based on energy minimization, 
including the cases lacking an Euler-Lagrange equation due to low differentiability prop- 
erties of the energy. However, their argument is tailored specifically for non-conforming 
finite element methods, with some remaining obstacles for the conforming case. 

In this article we develop convergence theory for a general class of adaptive approxi- 
mation algorithms for abstract nonlinear operator equations on Banach spaces, and then 
use the theory to obtain convergence results for practical adaptive finite element meth- 
ods (AFEM) applied to several classes of nonlinear elliptic equations. In the first part of 
the paper, we develop a weak-* convergence framework for nonlinear operators, whose 
Gateaux derivatives are locally Lipschitz and satisfy a local inf-sup condition. The frame- 
work can be viewed as extending the recent convergence results for linear problems of 
Morin, Siebert and Veeser [|49ll47l[6n to a general nonlinear setting. We formulate an 
abstract adaptive approximation algorithm for nonlinear operator equations in Banach 
spaces with local structure. The weak-* convergence framework is then applied to this 
class of abstract locally adaptive algorithms, giving a general convergence result. The 
convergence result is then applied to a standard AFEM algorithm in the case of sev- 
eral semilinear and quasi-linear scalar elliptic equations and elliptic systems, including 
a semilinear problem with polynomial nonlinearity, the steady Navier-Stokes equations, 
and a more general quasilinear problem. This yields several new AFEM convergence 
results for these nonlinear problems. 

A disadvantage of the weak-* convergence framework is that it does not give informa- 
tion on adaptive finite element convergence rate; strict error contraction results are key 
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to complexity analysis of specific instances of the AFEM algorithms. To allow for com- 
plexity results of this type, in the second part of the paper we develop a second abstract 
convergence framework based on strong contraction, extending the recent contraction re- 
sults for linear problems of Gascon, Kreuzer, Nochetto, and Siebert [17J and of Mekchay 
and Nochetto P31 to abstract nonlinear problems. We then establish conditions under 
which it is possible to apply the contraction framework to the abstract adaptive algorithm 
defined earlier, giving a contraction result for AFEM-type algorithms applied to nonlin- 
ear problems. The contraction result is then applied to a standard AFEM algorithm in 
the case of several semilinear scalar elliptic equations, including a semilinear problem 
with polynomial nonlinearity, the Poisson-Boltzmann equation [|3T1l and the Hamiltonian 
constraint [|35ll in general relativity, yielding AFEM contraction results in each case. 

The remainder of this paper is organized as follows. In Section [21 we develop an ab- 
stract framework for ensuring that a sequence of Petrov-Galerkin (PG) approximations to 
the nonlinear problem converges to the solution of a nonlinear equation, by ensuring the 
weak-* convergence to zero of the sequence of corresponding nonlinear residuals. This 
involves first establishing a priori estimates and a general convergence result in Sec- 
tion 12. 1[ together with recalling some (mostly standard) a posteriori error estimates in 
Section [Z2l In Section[3l we present a class of abstract adaptive algorithms which (under 
reasonable assumptions) fit into both the weak-* convergence framework developed in 
Section|2]and the contraction framework developed in Section[6l The class of algorithms 
is general enough to include both classical adaptive finite element methods (AFEM) for 
two- and three-dimensional elliptic systems, as well as AFEM algorithms for geometric 
elliptic PDF on Riemannian manifolds (cf. Il35l [30ll ). In Section IH we give the main 
convergence results for the class of adaptive algorithms described in Section |3l In partic- 
ular, we prove that the adaptive algorithm generates a sequence of approximate solutions 
which converge strongly to the solution, by showing that the corresponding sequence of 
nonlinear residuals weak-* converges to zero. We present a sequence of examples in Sec- 
tion|5]to illustrate the weak-* convergence framework. In Section[6l we outline a second 
distinct abstract framework for ensuring that a sequence of approximations to the nonlin- 
ear problem produced by an adaptive algorithm converges to the solution of a nonlinear 
equation, by ensuring strict contraction of the quasi-error (the sum of the error norm 
and the error indicator). This framework is based on establishing strengthened Gauchy 
and quasi-orthogonality-type inequalities for successive PG approximations produced by 
adaptive algorithms in Sections 16.21 - 16.31 together with a general abstract contraction re- 
sult derived in Section 16.41 The contraction result is an abstraction of the contraction 
arguments used in [|45l [T6l [351 [3T1l , suitable for use with approximation techniques for 
nonlinear problems. As in these existing frameworks, it is based on establishing: (1) 
quasi-orthogonality; (2) error indicator bound on the error; (3) a type of indicator reduc- 
tion. We prove that under these assumptions, the adaptive algorithm generates a sequence 
of approximate solutions for which the quasi-error strictly contracts. Finally, we present 
several examples of increasing difficulty in Section [7] to illustrate this framework. 

2. An Abstract Weak* Gonvergence Framework 

In this section, we focus on developing a general convergence framework for abstract 
nonlinear equations. To explain the problem class, the adaptive approximation algorithm, 
and the set of convergence results we wish to establish, let X and Y be real Banach 
spaces (complete normed vector spaces over the field R) with norms || ■ \\x and || ■ 
respectively. Denote the topological dual spaces of bounded linear functionals on X and 
Y as X* and Y* respectively. In this paper, we are interested in the convergence of a 
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general class of adaptive algorithms for solving the nonlinear equation: 

Find ue X, such that F{u) = 0, 

or in a weak form: 

Find M G X, such that {F{u),v) = 0, Vt; G Y, (2.1) 

based on placing some minimal conditions on the first (Gateaux or Frechet) derivative 
of F. We note that (|2.1I) often itself arises naturally through Gateaux differentiation of 
a scalar-valued energy J : X — ^ R, as the Euler Condition for stationarity of J{u), 
although we will consider the general case here whereby there may not be an underlying 
energy functional. In any case, recall (cf. [|651 [381 [5T| ) that the Gateaux variation of F at 
u G Xin the direction w E X is given as: 



F'{u)w = ^F{u + ew) 
de 



(2.2) 

and recall that when they exist as bounded linear operators, the Gateaux and Frechet 



derivatives at u in the direction w agree with F'{u) above, uniquely generated by (|2.2I) . 
Note that in general, the solution to equation (12.11) may not be unique. In this paper, we 
are interested in the locally unique solution, which is unique in a neighborhood: 

Definition 2.1. We say u E X is a locally unique solution to (12.11 ) in a neighborhood 
U cXofu,ifuis the only solution of (12.11 ) in U. 

Our aim now is to show that: For any convergent sequence {u^} in X, if the residuals 
F{uk) of the nonlinear equation (12.11 ) weak-* converge to zero, then the sequence con- 
verges to the solution of (12.11 ). Based on this abstract convergence result, the remainder 
of this section will be devoted to establishing existence, a priori error estimates, and a 
posteriori error estimates, for Petrov-Galerkin approximations to equation (12.11) . 

The following simple theorem will form the basis for our convergence analysis. 

Theorem 2.2. For a continuous (nonlinear) map F : X Y* , suppose that u E X is 
a locally unique solution to (12.11) in a neighborhood U (IXofu. Let {uk} G U be a 
sequence converging to some G U, such that 

lim {F{uk),v) = 0, Vf G Y. (2.3) 

k—^oo 

Then we have = u. 
Proof. We have 

{F{u,),v) = {F{u,) - F{uk),v) + {F{uk),v) 

^ \\F{u,) - F{uk)\\Y*\\v\\Y + I {F{uk),v) |. 
The conclusion follows by the continuity of F, (12.31) and uniqueness of uinU. □ 

One of our central goals in the paper is now to develop a practical way to generate the 
sequence {uk} satisfying the conditions in Theorem 12.21 To this end, we introduce two 
sequences of nested (finite-dimensional) subspaces 

Xo C Xi C . . . C X and Fo C Fi C . . . C y, 

where dim(Xfc) = dim(Yfc) for each A; G N. In addition, we introduce the spaces 

(Xqo, ^00) . 



X^ = \JX, , and Y^ = [jY, 



M. HOLST, G. TSOGTGEREL, AND Y. ZHU 



We focus on a class of approximation methods whereby the sequence of approximations 
{uk e Xk} C X to the exact solution u E X to (12.11) are generated by solving the 
Petrov-Galerkin (PG) problems 



Find Uk e Xk, such that {F(uk), Vk) = 0, Wv^. e 



(2.4) 



We next consider conditions on F to establish well-posedness of (12.41) . and derive a 
priori error estimates for the approximations Uk ^ u. 

2.1. A Priori Error Estimates. Let G be a mapping from X ^ Y*, understood as 
an approximation of F. Assume G satisfies the following conditions: 

(HI) There exists a constant 5 > such that G' satisfies 

||G"('u) — G'{x)\\^xY*) ^ -^11"" ~ ^\\x, Vx G X with \\u — x\\x ^ S. 

(H2) G'{u) is an isomorphism from X ^ Y*, and there exists a constant M > such 
that 



G''\u] 



< M. 



C{Y'-,X) 

(H3) ||G(w)||y. ^C, where C = min{^,^}. 

Assumptions (H2) and (H3) are stability and consistency conditions, respectively. If G 
satisfies (H1)-(H3), then we have the following lemma, similar to [i55t Theorem 2]. 

Lemma 2.3. Let G satisfy the assumptions (HI )-(H3), then there exist a constant 5o > 
and a unique uq ^ X such that G{uc) = 0, and \\u — ug\\x ^ ^o- Moreover, we have 
the following a priori error estimate: 

\\u-UG\\x^2\\G'{u)-'\\^^^,^^^\\G{u)\\y.. 
Proof. We show existence and uniqueness by fixed-point argument. Define first 

T{x)=x-G'{u)-^G{x), VxgX. 

This new operator T is well-defined because G'{u) is an isomorphism by Assumption 
(H2). Then for any xi, X2 G X we have 



|T(xi)-T(x2)||x 



(xi - X2) + G{u)'-\G{x2) - G'(xi)) 



[xi - X2) - G{u) 



/-I 



G'{xi + t{x2 — Xi))(Xi — X2)dt 



X 



G\u)-^ / {G\u)-G\xi+t{x2-xi))){xi-X2)dt 



Let 5o > such that 5q = min{(5, ^jj^}- We try to show that T is a contraction mapping 
in the ball B{u, 5q) C X. By Assumption (HI), we have 

\\G\u) - G'{{xi +t{x2 - Xi))\\c(X,Y') ^ -^^0, Vxi,X2 G B{u,6o). 

Therefore, by the choice of 5q and (H2) we have 

||T(xi) - T(x2)|U ^ L6o\\G'{uy^\\^^y,^^^\\xi - X2\\x - ^2\\x- 



CONVERGENCE OF ADAPTIVE METHODS FOR NONLINEAR PDF 7 

In addition, by using the above inequality and Assumption (H3), for any x E B{u, 5o) 
we have 

\\T{x)-u\\x ^ \\T{x)-T{u)\\x + \\T{u)-u\\x 
^ - mIIx + ||G"(m)""^G('u)||^ 

Therefore, T is a contraction mapping from B{u,5q) to B{u,5q). Thus, there exists a 
unique uq G B{u, 5q) such that uq = T{ug), that is, G{ug) = 0. Moreover, 

\\u - ugWx = \\u - T{ug)\\x ^ 2 \\G\u)-^\\^^y,^^^ 

which completes the proof. □ 

Lemma 12.31 provides us with an abstract framework for existence, uniqueness, and 
the a priori error estimate (giving continuous dependence) for the approximated scheme 
G{x) = 0. Based on this lemma, we now try to construct such a nonlinear operator G 
for the Petrov-Galerkin formulation (12.41) . This turns out to be nontrivial, since Petrov- 
Galerkin formulations are built only on the subspaces {Xk,Yk}, whereas the operator 
G : X — 7- F* is defined on the pair {X, Y). Therefore, for each pair {Xk, Yk), we need 
to construct an operator : X ^ Y* such that the weak solution of Fk{x) = is 
equivalent to the solution of (12.41 ). 

To this end, let us first introduce a bilinear form b : X x Y ^ M at u G X: 

b{x,y) = {F'{u)x,y), Wx e X, \/y e Y, (2.5) 
which is the linearization of F at m. Denote by the norm of b: 

\\b\\ := snp{b{x,y) : x e X, y eY s.t. \\x\\x = \\y\\Y = 1} = ||-^'(^^)||£(x,y*)- 
We assume "inf-sup" conditions hold for b, i.e., there exists a constant /3o > such that 
inf sup b{x, y) = inf sup b{x, y) = (3o > 0. (2.6a) 

X&X,\\x\\x = l y(zY,\\y\\Y = l J'ey,||j/||y = l x&X,\\x\\x = l 

This condition is equivalent to assuming that F'(u) is an isomorphism from X to Y* 
with 

\\F'{u)-'\\ciy.,X)=^o'- 

In the finite-dimensional spaces (X^, Y^), we assume that b satisfies a discrete inf-sup 
condition of the form 

inf sup b{x,y) = inf sup b{x,y) ^ f3i > 0. (2.6b) 

x&Xk,\\x\\x=i ?/eYfe,||i/!|y=i y&yk,\\y\\Y=i x€X^„\\x\\x=i 

Based on these inf-sup conditions, we have that b{-, ■) also satisfies the following inf-sup 
condition for the pair of spaces (Xqo, l^oo)- 

Lemma 2.4. Let the bilinear form &(■,■) satisfies the inf-sup condition (I2.6bl) on (X^, Yk) 
for k = 1,2, . . . . Then it satisfies the inf-sup condition on (Xqo, ^oo) '■ 

inf sup b{x, y) = inf sup b{x, y) ^ f3i > 0. (2.6c) 

xeXoo,\\x\\x='^ y&Yo^,\\y\\Y = l yeyoo,||j/||y = l xeXoo,\\x\\x = l 

Proof See ['49', Lemma 4.2]. □ 
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For each /c = 0,l,---,oo, inf-sup condition (|2.6bl) or (|2.6cl) implies existence of two 
projectors 

: X ^ Xfc and Ul :¥ ^ Y^, 

defined by 

b{x-U^x,yk) = yyteYk Vx G X, (2.7) 
6(xfc,|/-n^l/) = VxfeGXfe VyeF. (2.8) 

These operators are stable in the following sense: 

lin^llw,)^^ and ||nriU(y,y,) ^ ^. (2.9) 

In fact, take projector as an example, by the discrete inf-sup condition (|2.6bl ), we 
have 

/3i||n^x||x ^ sup b(Il^x,yk) 

yfeGift,||s/ft||i'=i 

= sup b{x,yk) 

yfcGYfe,||s/fc||y=l 

^ IIMIII^IIx. 

Moreover, the discrete inf-sup condition (I2.6bl) guarantees that 

(nf )2 = and (n^)^ = n^. 

Now we are ready to define the nonlinear operator Fk : X — > F* for A; = 0, 1, . . . , oo : 
{Fk{x),y) := {F{x),Uly) + b{x,y -Uly), WxeX,yeY. (2.10) 
By a direct calculation, we observe that 

{F;,ix)w,y) := {F'{x)w,Uly) + {F'{u)w,y - Uly) . (2.11) 

In particular, we have -F^(m) = F'{u). This operator Fk gives rise to another nonlinear 
problem: 

VmdweX, such that (Ffc(u7), = 0, My eY. (2.12) 

The equation (|2.12l) is posed on the whole spaces (X, Y). However, it is not difficult to 
verify that the solution to (12.41) and the zero of (|2.10l) are equivalent: 

Lemma 2.5 ( ||55l Lemma 1]). Uk G X^ is a solution of (12.41 ) if and only ifuk & X is a 
solution of (12.121) . 

Proof We include the proof here for completeness. If G X^ C X is a solution 
to (IZ41) . then 

{F{uk),Vk) = MvkEYk. 

Therefore, 

(FK),n^y) = o, VyGF. 

For the second term in (12.101 ), notice that Uk G X^, and by the definition of we have 

b{uk,y-nly)=Q, Vi/GF. 

Thus, Mfc G X is a solution to (12.121) . 

Conversely, let w G X satisfy {Fk{w),y) =0, \fy eY, that is 

{F{w),Uly) + b{w,y-Uly) = 0, VyeY 
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By choosing y = v — Il[v, we obtain b{w, v — H^f ) = 0, Vf G Y. By the definition 
of n|' and n^, we then have 

b{w - U^w, v) = b{w, V - Ulv) =0, e Y. 

Since the inf-sup condition holds for 6, we have w = H^w G X^. On the other hand, by 
choosing y = Vk ^ Yk, we then have 

{F{w),Vk) = 0, ^VkeYk, 

which implies that w G is a solution to (12.41) . □ 

Lemma 12.51 shows that (12.121 ) is actually a reformulation of (12.41 ), which posed in 
(Xfc, Yfc), into the whole spaces (X, Y). It enables us to obtain the well-posedness and a 
priori error estimate of (12.41) by applying Lemma [23] to Fk- More precisely, we have the 
following main result. 

Theorem 2.6. Suppose equation (12.11) and the discretization (12.41) satisfy the inf-sup con- 
ditions (|2.6a| ). (|2.6b|) respectively. Moreover, suppose that F' is Lipschitz continuous at 
u, that is, 

3(5 and L such that for all w E X, \\u — w\\x ^ S 
\\F'{u) - F'{w)\\cix,Y*) < L\\u - w\\x. 
If in addition the subspace Xq satisfies the approximation condition 



^WJ|.-,„||,<||(,|r'^^l + ^J mi„|^,^|, (2.13) 

then there exist a constant 6i > such that equation (|2.4I) has a locally unique solution 
Uk G Xfc in B{u, 6i) for any k ^ such that Xq C X^. Moreover, we have the a priori 
error estimates: 

\\u - UkWx ^ ^ f 1 + min \\u - Xk\\x. (2.14) 

Proof. By Lemma 12. 5[ a solution to equation (12.41) is equivalent to a solution to the 
equation (12.121) . By choosing G = in Lemma [23l we only need to verify Assumptions 
(H1)-(H3). 

Note that (12.1 II ) implies Fl^{u) = F'{u). Therefore, we have 

\\F'{u)-^\\co^.^x) = Po' 

from the inf-sup condition (|2.6a| ). The assumption (H2) follows. Again, by (12.1 II ) we 
deduce that for any w,x e X and y eY, 

{(Kiu) - F^{x))w, y) = {{F'{u) - F\x))w, lily). 

Therefore, 

\\F',{u) - ^ \\F\u) - F'{x)\\^^^^y,^ WkWciYX,) 

^ ^^LWu-xU. 
Pi 

where in the second inequality we used stability (12.91) of 11^. Hence, Fk satisfies (HI). 
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For Assumption (H3), we have 



l-P'feMlly* = sup {Fk{u),v) 

VdY, ||l'||y=l 

= sup h{u, V — n|'t>) 
= sup b{u — H^u, v) 

V&Y, ||tl||y=l 

^ \\h\\\\u-Yi^u\\x. 



By triangle inequality and stability (12.91 ) of 11^, we have 

\\u - nf ^ \\u - Xk\\x + l|nf (m - Xk)\\x ^ (^i + ) . Ik - Xk 

Therefore, we obtain 



Pi I Xk&^k 



\F,{u)\\y,^\\h\\[l+'-^\ inf \\u-Xk\\x. 



Notice that Xq C X^, and by assumption (|2.13l) we have 

\\Fkiu)\\y,^\\b\\(l+^^) inf ||w-XolU^min 

Hence, Assumption (H3) is satisfied. Therefore by Lemma [2Jl the there exists a constant 
^1 > such that equation (12.41) has a locally unique solution E X^ in B(u, Si) for any 
k ^ 0. Furthermore, we have the following a priori error estimate: 

\\u - u,\U K 2 \\F,{u)\\y^ ^ ^ + x^k " 

This completes the proof. □ 

Remark 2.7. Theorem \2.6\ is similar to [,55. Theorem 4]. However, instead of assuming 
the approximation property 

lim inf — a;fc||x = 

as used in their proof, we only assume that the initial subspace Xq satisfies (12.131) . This 
is important because in the adaptive setting, we cannot (and of course, do not want to) 
guarantee that /i — )■ uniformly. The assumption (12.131 ) is essentially the approximation 
property of the subspace Xq, since that 

inf \\u - xo\\x ^ \\u - Iqu\\x- 

In most of the applications we consider, the finite element space Xq has certain approx- 
imation property, i.e., \\u — Iqu\\x = O^Hq) for some a > 0, where Iq is inclusion 
or quasi-interpolation. Therefore, the condition (12.131 ) can be satisfied by choosing the 
meshsize Iiq of the initial triangulation to be sufficiently small. 

Based on Theorem 12. 6[ there exists a locally unique solution u^o G B{u, 5i) C X^o 
with the test space Y^. In the remainder of this section, we will show that the PG so- 
lution sequence {uk G Xk} converges to the solution Uoo G -^oo of (12.41) in (Xoo, Yoo)- 
Therefore, we indeed constructed a convergent sequence Uk -> u^o as /c — )• oo by the 
Petrov-Galerkin approximation. 

With this Moo, let us introduce another bilinear form 6oo(-, ■) : -^oo x Yoo — > IR as 

y), Vx G Xoo ; y G Foo, (2.15) 
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which is formed by linearizing F at € X^o- Comparing with (12.51) . we have 

sup b^{x,y) = sup {b{x,y) + b^{x,y) -b{x,y)) 

J/G^oo,||s/||i' = l y&Ya^,\\y\\Y=l 

> (3o- sup {{F'{uoo)-F'{u))x,y) 

yeyoo,||s/||y=i 

> /3o-\\F'{u)-F\u^)\\cix,Y*), VxgXoo, ||x|U = 1- 

Therefore, if F' satisfies the Lipschitz continuity condition for some 5 > as stated 
in Theorem 12.61 then we can choose a constant 5i > sufficiently small such that the 
following inf-sup condition holds in B(u, 6i) : 

inf sup boo{x,y) = inf sup boo{x,y) = f3o > 0. 

y&Yoo,\\y\\Y=i 2.gXoo,lklU=i xex^,\\x\\x=i j/gy^,||j;||y=i 

(2.16a) 

Similarly, we can show the discrete inf-sup condition holds in B(u, 5i) : 

inf sup boo{x,y) = inf sup boo{x,y) = /3i > 0. 

(2.16b) 

These inf-sup conditions imply that there exists stable projections 11^ and 11^ similar 
to (|2.7I) - (|2.8I) . Same as before, we can define a sequence of nonlinear equations: 

Find x e Xoo, such that {Fk{x),y) = 0, Vy G Y^, (2.17) 

where 

Fkix), y) = {Fix), Illy) + booix, y - Illy). 

Following the same lines of the proof of Lemma 12.51 one can show the solution to the 
nonlinear equation (12.171) is the solution to the PG problem (12.41) for each A; = 0, 1, ... . 

In the proof of Theorem 12.61 . if we replace (X, Y) by (Xqo, ^oo), u by Uoo and the 
inf-sup conditions (I2.6al) - (l2.6bl) by (I2.16al) - (l2.16bl) . then we have the following theorem. 

Theorem 2.8. Let the assumptions in Theorem \2.6\ be fulfilled. Then there exists a neigh- 
borhood B{u, 5i) ofu such that the equation (12.41) has a locally unique solution Uk G X^ 
for each k ^ 0. We also have the following a priori error estimate: 

\\uoo -UkWx ^ C inf \\uoo-Xk\\x, V/c = 0, 1, ■ • • . 

Xk^Xk 

Consequently, the PG sequence {u^} converges to Uoo, that is, lim Uk = in X. 

Proof. By the same argument as in Theorem 12.61 equation (12.41 ) has a locally unique 
solution Uk G Xfc for each k ^ 0. Furthermore, we have the quasi-optimal estimate: 

\\uoo - UkWx ^ C inf \\u^-Xk\\x, V/;; = 0, 1, ■ • • . 
By density of IJfcli in Xoo, we then have lim \\uk — Uoo\\x = 0- D 

k—^oo 

Remark 2.9. Theorem \2.8\ confirms that the approximate sequence {uk} has a limit 
Uoo £ Xoo- However, this u^o does not necessarily coincide with the exact solution u. 
Note that u^o = u if and only if the residual F{uoo) = 0. Obviously, this is the case when 
Xoo = X. However, in general adaptive settings, one has Xoo 7^ X. Nevertheless, by 
Theorem \2.2\ it suffices to verify the weak-* convergence: F{uk) 0. 
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2.2. A Posteriori Error Estimates. Given any approximation of n, the nonlinear 
residual F{uk) can be used to estimate the error ||m — Mfc||x7 through the use of a lin- 
earization theorem |l43l[68l- An example due to Verfiirth is the following. 

Theorem 2.10. Il68l Let u & X be a regular solution of (12.11) so that the Gateaux 
derivative F'{u) is a linear homeomorphism ofX onto Y*. Assume that F' is Lipschitz 
continuous at u, that is, 

36 and L such that for all w & X, \\u — w\\x ^ S 

\\F'{u) - F'{w)\\cix,Y*) < L\W - w\\x. 

Let R = mm{6, \\F' {u)^^\\ c{Y'- ,x) , '^L~^\\F'{u) \\c{x,y*)}- Then for all Uk E X such 
that \\u — Uk\\ < R, 

Ci||FK)||y. ^ ||n - UkWx ^ C2||FK)||y*, (2.18) 
where C, = il|F'(M)||^(x,y.) and C2 = 2\\F'iu)-'\\ciY^,x). 

Proof See [[61. □ □ 

The linearization is controlled by the choice of 5 sufficiently small, where 5 is the 
radius of an open ball in X about u. The strength of the nonlinearity is represented 
by the factors in (12.181 ) involving the linearization F'{u) and its inverse. To build an 
asymptotic estimate of the error, one focuses on two-sided estimates for the nonlinear 
residual ||F(ufc)||y. appearing on each side of (12.181) . 

3. A General Adaptive Algorithm 

The analysis in Section [2] reveals that under reasonable assumptions on the nonlinear 
operator F{-), the Petrov-Galerkin problem (12.41) is well-posed. Moreover, given the 
nested subspaces {Xk} and {1^}, the solution sequence {uk E Xk} converges to the 
exact solution u E X if the corresponding residual sequence {F{uk)} C Y* weak-* 
converges to zero, that is 

hm{F{uk),v) = 0, ^veY. (3.1) 

In this section, we show how to construct subspaces {X^, Y^j in an adaptive setting so 
as to ensure (13.11 ). In particular, based on a few assumptions on the algorithm, we show 
that the solution sequence generated by the algorithm produces a residual sequence that 
satisfies (13.11 ). 

3.1. The Setting: Banach Spaces with Local Structure. Since the algorithm to be 
analyzed is of a finite element type, we need to have as the spaces X and Y function 
spaces defined over a domain in M*^, or over a manifold. The manifold setting is more 
general because a domain is trivially a manifold; however, in order to avoid the necessary 
differential geometric language to also cover the case of geometric PDE on manifolds, 
we consider here the even more general setting of measure spaces, which allows for a 
simple and transparent discussion of the core ideas. (In ll36l[34ll . we consider specifically 
this geometric PDE setting.) 

Let (fi, S, fi) be a measure space, where is a set (a subset of R"' or a rf-manifold), 
S is a o"-algebra, and /i : S — t- [0, 00] is a measure. Recall that a o -algebra S C 2^ 
over is a partition (a collection of subsets or elements) of Vt which contains Vt, and 
is closed under the complement in Vt and countable union operations. Then a measure 
/i : E — )■ [0, 00] is a function with /i(0) = and additive under disjoint countable unions. 
We say that T is a partition (a set of subsets) of Vt with elements (simply connect subsets) 
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{rjreT if Urer ^ = ^ n = for any ti,T2 E T such that ti ^ T2. We introduce 
the meshsize function hj- associated to T as 



/ir(a;) = /i(r)d, \/xeTeT. 

Note that hj- is well defined up to a (/-dimensional Lebesgue measure zero skeleton. 
Thus, we can understand hj- E as a piecewise constant function. Given any 

subset iS C T, we denote ^l^ = IJre5 ^- ^i^) finite element subspaces 

defined on each element r E T. We denote X{^ls) := [J^^g X{t). For simplicity, let 
X(T) := X{^l'T-). We use similar notation for Y and Z below. 

We assume now that the Banach spaces X(T) and Y(T) associated with the partition 
T have certain local structures provided by the associated measure space (Vl, S, yu). In 
particular, we assume that the induced norms || ■ \\x and || ■ ||y are subadditive in the 
underlying domain: 

{lkllx(r)},g^||^^ - \\w\\x, wwEX{ry, 

(3.2a) 

{lkl|yM},,^||^^ ^ \\v\\y, yvEY{T). 

In addition, we assume that the norms are absolutely continuous with respect to the 
measure yu(-) in the sense that, for any w E X and v eY, there holds 

— ^ and ||f — )■ 0, as fi{Lu) — > 0. (3.2b) 

Furthermore, we assume that the abstract or generalized finite element spaces have the 
following local approximation property: Let Y C F be a dense subspace of Y; we 
assume that for any partition T, there exists an interpolation operator I-j- ■ Y ^ Y{T) 
such that for all t> E Y, 

\\V - lTv\\Y(r) < ||/iflloo,r||t^||y(,), Vt E T, (3.2c) 

where s > is a constant. 

The two most relevant examples of such Banach spaces with this type of local struc- 
ture are subspaces X C L^(fi), where is either a bounded open subset of M", or where 
f2 is a Riemannian iy*'''-manifold (a differentiable manifold with metric in W^''^), and 
where T is a partition of ^7 into elements r. Such subspaces then include Sobolev spaces 
of scalar and vector functions over domains and partitions in M" (cf. [|42l [TJ), as well 
as Sobolev spaces of W^'^'P-sections of vector bundles over f2 and partition elements r 
(see [|53l|29l[32l for a discussion of these spaces). See [[30l|33ll35l and Section IT41 for 
examples in the case of manifold domains. We note that to show (I3.2cl) holds in specific 
cases, it is not enough to assume that h^-ix) is sufficiently small, but also that certain 
geometric (e.g. geodesic angle) conditions hold for the elements {r}. In this article, 
we assume that the subspace contruction schemes produce partitions {r} satisfying the 
appropriate geometric conditions so that (|3.2cl) holds. Finally, we remark that an inter- 
mediate space Z such that X C Z, with continuous (even compact) embedding 

x(r) ^ z{r), (3.3) 

will sometimes play a critical role. It is assumed that Z has the same local structure as 
X and Y over a measure space (fi, S, /i), in that both (|3.2a|) and (|3.2b| ) hold for Z. The 
role of Z will usually be played by L^^^l) for suitably chosen exponent p. 



14 



M. HOLST, G. TSOGTGEREL, AND Y. ZHU 



3.2. The Algorithm: SOLVE-ESTIMATE-MARK-REFINE. We now formulate an 
adaptive algorithm based on enriching the local structure using error indicators, parti- 
tion marking, and partition refinement. Let := X(Tk) and Yk := Y(Tk) be the 
abstract finite element spaces defined on the partition Tk- Given an initial partition To of 
the domain, the adaptive algorithm for solving equation (|2.1I) is an iteration involving the 
following main steps: 

(1) nfc:= SOLVE 

(2) {rj{uk, t)Ut, ■■= ESTIMATE Tk) ; 

(3) Mk ■■= MARK({r/(Mfc,r)Ker.,rfc); ^'''^^ 

(4) Tk+i := REFINE (7;,A^fc,£), increment A;. 

We will handle each of the four steps as follows: 

• SOLVE: We use standard inexact Newton + multilevel solvers for equation (12.4!) 
to produce Uk E Xk on each partition % (cf. [|7l[30l|24J). To simplify the analysis 
here, we assume that the discrete solution Uk is the exact solution to (12.41 ). 

• ESTIMATE: Given a partition Tk and the corresponding output Uk E Xk of the 
SOLVE modules, this module computes and outputs the a posteriori error esti- 
mator {r7(nfc, r)},-^^;;) where for each element r G 71- the indicator r7(ufc, r) ^ 0. 

• MARK: Based on the a posteriori error indicators {r](uk,T)}reTk^ this module 
gives a strategy to choose a subset of elements AikOfTk for refinement. 

• REFINE: Given the set of marked elements Aik and the partition Tk, this pro- 
cedure produces a new partition Tk+i by refining (subdividing) all elements in 
^Ak i ^ 1 times. Some other elements in Tk \ M.k may also be refined based 
on some requirement of the partition, such as geometric relationships between 
neighboring elements (sometimes called geometric conformity) in order to sup- 
port construction of the spaces X(T). This procedure is known as completion. 

Now we state some basic assumptions on these modules, which will be used in the 
convergence analysis in Section IH 

3.2.1. REFINE. We suppose that refinement relies on unique quasi-regular element sub- 
divisions. More precisely, there exist constants ci, C2 G (0, 1) independent of the parti- 
tion T, such that any element r E T can be subdivided into n(r) ^ 2 subelements 
^^•••'<(r) such that 

n(T) 

r = n U ■ ■ ■ U r;(,), /i(r) = ^^/.(r;), (3.5a) 

i=l 

and 

ci/i(r) ^ /i(r^) ^ C2;u(r), i = 1, . . . , n(r). (3.5b) 

We define now the class Q admissible partitions of as the subclass of all partitions 
of n that satisfy the two properties: 

• The partition is subordinate to (a refinement of) %', 

• The partition is locally quasi-uniform in the sense that 

sup max #iVr(r) < 1, sup max -^y^ < 1, (3.5c) 

Teg ^er reg^'eAfr(^) 

where Nfij) := {r' E T\t' fl r 7^ 0} denotes the set of neighboring elements 
of r in T. 
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In addition, we suppose that the output partition 

T' := REFINE(r,M,£) 

satisfies the requirement 

\/t eMdT.T iV, (3.5d) 

that is, each marked element of the input partition is subdivided at least once in the 
output partition. Additional elements in T \ may be refined in order to fulfill some 
other requirements for partitions coming from class Q\ for example, properties such as 
geometric conformity may need to also hold in specific case of constructions of X(T) 
over T in order to ensure that (|3.2cl) holds. 

3.2.2. SOLVE. We assume that the abstract finite element spaces X(T) and F(T) build 
over T have the following two natural properties. Let T,T' E Q. The spaces X{T) and 
Y(T) are called conforming if 

XiT) C X and Y{T) C Y, and dimX(r) = dimy(r), (3.6a) 

and are called nested if 

if T' is a refinement of T then X{r) C X(r') and Y{r) C F (T')- (3.6b) 

We note that the underlying paritition T does not need to be geometrically conforming in 
order for the spaces built over T to be conforming in the sense of (I3.6al) . We also assume 
that the discrete inf-sup condition (I2.6bl) holds: 

inf sup b{x,y) = inf sup b{x,y) ^ (3i, (3.6c) 

x€X{T),\\x\\x=l j/gy(r),||y||y=l y'^Y{T),\\y\\Y='^ x&X{T),\\x\\x=l 

with some constant (3i > 0. In most conforming finite element spaces in Sobolev spaces, 
this is an immediate consequence of the usual interpolation error estimates, cf. [22]. In 
Theorem 12.61 for the well-posedness of the discrete equation, we require the space Xq 
satisfies (12.131) : 

||.-..|U«|N-(l.M)-'n,in{f,§}. (3.M, 

where = \\F'(u) ||£(x,y*), (3o, A are the inf-sup constants in (I2.6al) and (I2.6bl) respec- 
tively, L is the Lipschitz constant for F'{u) and 5 is the Lipschitz radius. Moreover, we 
suppose that the output 

ur:= SOLVE (x(r),r(r)) 

is the Petrov-Galerkin approximation of u with respect to (X(T), Y{T)) : 

ur e X{T) : {F{ur),v) =0, Vt; G Y(r). 

Thanks to (I3.6al) . (|3.6cl) and the assumption on the initial partition (I3.6dl) . by Theorem l2.6l 
the Petrov-Galerkin approximation ur exists, is unique, and is a || ■ ||x -quasi-optimal 
choice from X{T). 

3.2.3. ESTIMATE. Now we make some assumptions on the output 

MuT,r)},eT ■■= ESTIMATE(«r,r) 

for any admissible partition T E Q. First, we assume that the following estimate holds 
for the Petrov-Galerkin approximation uj- : for any subset S C T and v eY, 

{F{ur),v) <r]{ur,S)\\v\\Y{ns)+v{uT,T\S)\\v\\Y{nr\s), (3.7a) 
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where ri{ur, S) = \\{ri{ur, T)}Tes\\ep ^ Uaes f'" for 5 C T. We note that the 

estimate (I3.7al) implies the global upper-bound 

\\u-ur\\x <viur,T). (3.7b) 

Second, we assume the error indicator r]{ur, t) satisfies local stability. More precisely, 
there exists a function D E Z(^l) such that 

v{ur,r) < \\ur\\x{ujr(T)) + \\D\\z{cor{r))^ ^ (3-7c) 

where wr(T) C i7 is the patch (union) of elements in N-j-ir), and where the space Z is 
the appropriate auxiliary space as in (13.31) in Section [311 

Remark 3.1. We remark that the stability assumption (I3.7cl) is weaker than the local 
lower bound bound. As we can see from the examples in Section |5] one can obtain the 
stability estimate (|3.7cl) from the usual local lower bound estimates. 

3.2.4. MARK. We suppose that the output 

M:=M ARK {Mur,T)UT,r) 
of marked elements has the property 

r]iur,T) ^^{maxri{ur,a)), teT\M, (3.8) 

where ^ : ]R_|_ — ^ ]R_|_ is a continuous function satisfying ^(0) = 0. Most marking strate- 
gies used in practice satisfy (13.81) . For instance, the maximum strategy or equidistribution 
strategy, cf. [48]. In particular, the following Dorfler marking strategy also satisfies the 
assumption (13.81) : Given Q e (0, 1], a marked subset M. of elements is constructed to 
satisfy 

r]{urM)>Qr]{ur.r). (3.9) 

This marking strategy, which was proposed by Dorfler ll26l in his original AFEM con- 
vergence paper, is proven to be crucial in the proof of contraction, cf. ll46l[T6l . We refer 
to Section |7] for more detail. 

4. Convergence Analysis 

Based on the assumptions on the adaptive algorithm, and on the abstract framework 
discussed in Sections [2l we are now ready to state and prove the abstract convergence 
result based on a weak-* residual convergence. 

Theorem 4.1 (Abstract Convergence). Let ube a locally unique exact solution of (|2.1I) . 
Assume that the nonlinear operator F'{u) satisfies the inf-sup condition (I2.6al) and is 
Lipschitz continuous in a neighborhood ofu. Let {uk} be the sequence of approximate 
solutions generated by iteration (13.41) . 

If the finite element spaces (X^, Yk) satisfy (13.21) . and the modules REFINE, SOLVE, 
ESTIMATE, and MARK satisfy, respectively, (1331) . (IX6l) . (1377]) . and (IX8l) . then there 
exists Uoo e X such that lim Uk = Moo- Moreover, the sequence {uk} satisfies 

k—^oo 

lira {F{uk),v) = 0, Vf e F. (4.1) 

k—^oo 

Consequently, we have Uoo = u, that is lim — = 0. 

k—^oo 
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We split the partition Tk into two sets 7^^ and T^, where 

= {t eTk-.T e%, \/i^k} 

contains all the elements that will not be refined after k-th step, and = Tk \ Tj^ is the 
set of elements that will be refined at least once after A;-th step. Here the superscript '+' 
means the measure of the elements in Tf^ is positive. We denote 

= ilj-o ■■= [j r and ^]+ = := |J r. 

oo 

For simplicity, we denote = H^i*- 

i=0 

We note that the sequence {hk} C L°°{fl) of the meshsize function is bounded and 
monotone decreasing for a.e. x E Vl. Moreover, we have 

Lemma 4.2 ([|48l Corollary 4.5]). The sequences {hk} and {^1} satisfy 

lim WkkWio^mo, = 0. 

Now we are ready to prove Theorem 14.11 

Proof of Theorem \4J\ Theorem l2.8l shows the existence of the Petrov Galerkin solutions 
Uk G Xk and u^o e X such that 

lim Uk = Moo- 

fc— >oo 

If we can show (|4.1I) . that is, the residuals weak-* converge to 0, then Theorem 12.21 
implies that Mqo = u. Therefore, we need to prove (14.11) . Notice that Y is dense in Y, we 
only need to show that 

lim {F{uk),v) = 0, Vm G Y. (4.2) 

fc— ^-oo 

By definition, the sets Tj^ are nested, that is for any j ^ k, 

T+ C C Tk and = ^7-^^^-+. 

Applying the upper bound (|3.7al) with T = Tk and S = Tj^, for any v eY we have 

{F{uk),v) = {F{uk),v -v) < ri{uk, Tk \ Tj^)\\v - vWy^qO) + vi^k, Tj^)\\v - M||y(Q+), 

(4.3) 

where v is arbitrary in Y^. Given any e > 0, we need to show that for sufficiently large 
k and j, and for a suitable v G Yk, each term in the right hand side of the above estimate 
can be bounded by a multiple of e. 

By the local approximation assumption (I3.2cl) . there exists av := IjV EYjCYk such 
that 

So according to Lemma |43] for sufficiently large j, we have \\v — v\\Y{n'^) ^ §• On the 
other hand, it is easy to see that \ il°) — )■ as j — )• 00. Therefore, by (I3.2bl) for 
sufficiently large j one has ||f — v\\Y(no\^no) ^ §• Hence by (|3.2al) . we obtain 

11^ - ^\\y{q']) ^ 11^ - ^||y(no) + \\v - M||y(f^o\f^o) ^ e. 
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Notice that r](uk, % \ T^) is uniformly bounded because 



IP 



m 



< {\\Uk\\xMr))}^^j-^ 

< \\uk\\x + \\D\\z 

where in the second inequality, we used the inequality (|3.7cl) . and in the third inequality, 
we used (I3.5cl) and (|3.2al) . Now since lim \\uk — u^oWx = 0, for sufficiently large j ^ k 

k—^oo 

we have 

ri{uk,Tk \ Tj^) ^ 2||noo|U + II^IU- 
Therefore, the first term in the right hand side of (14.31) satisfies: 

v{uk,Tk\Tj^)\\v -v\\y(^n<^^) < (2||moo|U + \\D\\z)e. 

We fix this j and consider the second term in the right hand side of (14.31) . and let k ^ j. 
By marking strategy (13.81) . for all r G 7^^ C Tj^, we have 

and moreover for a E we have 

V{Uk,(r) < \\Uk\\x{uj,{a)) + \\D\\z{uj,{a)) 

^ ll^^fc - Uoo\\x(uJk{cT)) + \\Uoo\\x(uJkicT)) + 

The first term goes to zero because — t- Uoo- For the second and third terms, we notice 
that n^LOkicr)) — )■ as A; — )• oo by the locally quasi-uniformity (|3.5cl) and Lemma |431 
Hence || ■ Hx^M) ^ and || • |U(a;,(<T)) ^ as /^(^^(o")) ^ by (I3.2bl) . Therefore, we 
can choose k ^ j sufficiently large such that r](uk, T^) ^ s. Finally, we proved that 

lim = 0, G F. 

A;— >oo 

Therefore, (14.11) holds. This completes the proof. □ 

The convergence of — )• m as A; — )• oo in Theorem l4.1l does not imply the convergence 
of the estimator. It is indeed possible for the error indicators to be not efficient in the sense 
that they might contain strong overestimation. In other words, an efficient error indicator 
should be bounded by the error ||m — u^Hx in certain way. 

M . Ml with 



Theorem 4.3. Let there exist D E Z and a continuous function : M+ 
0(0) = 0, such that for any T E Q and t E T 

V{Uk, r) < \\U - UkWxir) + 0(^(^)) {\\Uk\\x{r) + ||^IU(r)) • 

Then under the hypotheses ofTheorem \4.1\ we have 

lim r]{uk,Tk) = 0. 

fc— >oo 

Proof. For A; ^ j by definition of r] and (14.41) we have 

Viuk, Tk) < ri{uk, Tk \ Tj^) + 'r]{uk, Tj^) 



(4.4) 



< 



+ 



\u 



Uk\\x{n^o)+v{uk,Tj^) 



{0(^(^)) (hfellx(r) + \\D\\z(^r))} 
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By Theorem 14.11 we have \\u — Uk\\x(nO) ^ \\u — u^Wx — t- 0, as A; — t- oo. The second 

term goes to zero since for any r E %\ Tj^, /i(r) — )• as j — )■ oo. We follow the same 
arguments as in Theorem l4.1l to show that the last term converges to zero. This completes 
the proof. □ 

Note that convergence of both the error (Theorem 14. II) and the estimator (Theorem 14. 3 1) 
are important. The convergence result in Theorem 14. 1[ lim ||m — m^Hx = 0, means that 

fc— )-oo 

the approximate solutions get arbitrarily close to the exact solution. However, this would 
be of little practical use without the second convergence result, namely lim 77 = 0, 

which is the computable counterpart of the first result and thus allows one to recognize 
the improvement of the approximate solutions. In particular, lim r] = ensures that if 

fc— >oo 

one includes a stopping test with a given positive tolerance, then the algorithm stops after 
a finite number of iterations. 



5. Examples 

In this section, we present some nonlinear examples, and apply the weak-* conver- 
gence framework developed in the previous sections to show convergence of the adaptive 
algorithm (13.41 ) for these problems. We consider a fairly broad set of nonlinear problems 
(see [|T3l [68l l57l for example), and show how the weak-* framework can be applied 
in each case. Specifically, we consider a specific semilinear problem with subcritical 
nonlinearity, the stationary incompressible Navier-Stokes equations, and a quasi-linear 
stationary heat equation with convection and nonlinear diffusion. Many other nonlinear 
equations are also covered by this general framework. 

We restrict polygonal (or polyhedral) domains C W'-, where d = 2, 3 is the space 
dimension; however, all of the results extend to more general domains with standard 
boundary approximation algorithms and analysis techniques. In the examples presented 
here, the function spaces X and Y are the Sobolev spaces W^'P{VI) with s ^ andp > 1, 
equipped with the norm || ■ \\s,p,n and semi-norm | ■ \s,p,n- The space Wq^{Q) is the closure 
ofV{n) in W''P{n), and W''''^{n) is the dual space of W^'^in) with 1 + 1 = 1. When 

p = 2, we shall denote H'{n) and H^{n) instead of W''^{n) and W^'^^n) respectively, 
with the norm || ■ ||s ^ and semi-norm \ ■ \s,n instead of || ■ \\s,2,n and | ■ \s,2,n- The space Z 
in the convergence analysis is taken to be U'{fl) for suitable choice of p. More detailed 
presentations of the Sobolev spaces can be found, for example in the monographs [|42l [T1l 
in the case of domains in M*^, or [|53l |29l [32ll in the case of manifold domains. These 
Sobolev spaces satisfy the subadditive assumption (13.21 ). 

Let the initial partition To of be conforming and shape regular. We restrict ourself to 
a shape-regular bisection algorithm for the refinement. There is a vast literature on bisec- 
tion algorithms; cf. O] [191 [63]| and the references cited therein. It is well known that 
the bisection algorithm as well as the shape-regularity of % guarantee assumption (13.51) 
holds for any partitions generated by the algorithm. Without loss of generality, we as- 
sume that ||/io||oo,Q ^ 1 is fine enough such that (|3.6dl) holds. 

Starting from the initial triangulation To, the adaptive algorithm (13.41) generates a se- 
quences of shape-regular triangulations {Tfejfc of fi, as well as a sequence of approximate 
solutions {uk}. For the marking strategy, the condition (13.81 ) is satisfied for example if we 
use Dorfler's strategy (cf. Il26l ) (13.91) , or the Maximum strategy (cf. [4]). Apart from the 
assumptions on mesh refinement and the marking strategy discussed above, for each in- 
dividual example below, we need to construct the specific finite element spaces C X 
and Yk CY which satisfy the conditions (13.61) . We also need to define the specific error 
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indicator r], which satisfies (I3.7I ). More precisely, according to Theorem 1431 and |43] we 
only need to: 

(1) Verify the continuous inf-sup condition (12 .Gal) and the uniform discrete inf-sup 
condition (I2.6bl) of the bilinear form &(■,■) defined by (12.51) : 

(2) Define appropriate error estimator i] which satisfies (13.71) : 

(3) Verify that t] satisfies (14.41) to prove the convergence of error indicator Theo- 
rem |4]3l We note that the standard local lower bounds for the error indicator will 
guarantee (14.41 ). 

In the remainder of this section, we will follow the general framework presented in 
Section [Z2l (cf . [|68l |69l ) to derive a posteriori error estimates for each example. We 
then verify the basic assumptions on the error estimators and the nonlinear equations. 
As a consequence, we then conclude convergence of the adaptive algorithm for each 
example. 

5.1. Semilinear Examples: Single Equations and Systems. In this subsection, we 
give two semi-linear examples. The general formulation of a semi-linear equation is 
as follows: 

F{u) := Lu + N{u) = 0, (5.1) 

where L : X — )■ F* is a bounded linear operator, and A^(-) : X ^ Z C Y* is a 
mapping from X onto a subspace ZofY*. We assume that 

(51) L satisfies the continuous as well as the discrete inf-sup conditions: 

inf sup {Lx,y) = inf sup {Lx,y) = ao > 0. (5.2) 

xeX,\\x\\x=l yeY,\\y\\Y=l !/Gy,||j/||y=l x€X,\\x\\x=l 

inf sup {Lx,y) = inf sup {Lx,y) = ai > 0. (5.3) 

xeXk,\\x\\x = l j/eYfe,||j/||y = l J/GYfc,||j/||y = l x£Xk,\\x\\x = l 

(52) The embedding Z cY* is compact as in (13.31 ). 

First of all, we establish well-posedness of the equation (15.11 ) under the above assump- 
tions on L and N. 

Theorem 5.1. Let F satisfy (SI) and (S2), and Xq satisfy (12.131) . If N'{u) is Lipschitz 
continuous in a neighborhood ofu, then the Petrov-Galerkin problem (|2.4I) possesses a 
unique solution Uk in a neighborhood ofu. Moreover, we have the error estimates 

\W-Uk\\x< inf \\u-Xk\\x- 

XfcG^fe 

Proof. It is straightforward to check that F'{u) is Lipschitz continuous. The following 
inf-sup condition was proved in [57, Theorem 5.1]: 

inf sup {F'{u)x,y) = inf sup {F'{u)x,y) = f3i > 0. 

xeX^,\\x\\x=l j/GYfe,||j/||v-=l yGYfc, ||s/||y=l ,^^Xk,\\x\\x=l 

Then the conclusion follows by Theorem |2.6[ □ 

Remark 5.2. The (global) existence and uniqueness of the solution can sometimes be 
proved by standard arguments in the calculus of variations. The a priori error estimate 
in Theorem I5.il can also be proved in a different way, if a priori estimates on the 
solution u and the discrete solutions Uk hold. We refer to Section^\for the details. 

Example 5.3. Consider the following semi-linear equation 

F{u) ■= -Au + u"^ - / = 0, (5.4) 

with homogeneous Dirichlet boundary condition u\dQ = 0. We assume that m ^ 2 is a 
constant f G L^i^Q) for some p > 1 satisfies p ^ d — 



d_ 
m ' 
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For this nonlinear equation, we define the linear and nonlinear components of F as 
Lu = -Am and N{u) = - f. We let X = W^'^in) with \\ ■ \\x = \ ■ |i,p,Q and 
Y = W^''^{^1) with the norm || ■ ||y = | ■ \ i,q.n, where q satisfies ^ + ^ = 1. By the Sobolev 
Embedding Theorem and the choice of m, we have W^'^^Vl) Therefore, 
for any u G Wq'^{Q), we have N{u) G L^i^), which is compact embedded in Y* = 
W~^'P{ri). A special case when m = 3 and p = 2 in can be found in Rappaz [57J. 

Given a conforming triangulation 7^, let Xjt C X and C F be the piecewise linear 
continuous finite element space defined on %. Then the finite element approximation of 
the equation (15.41) reads, 

find Uk G Xk, such that / Vuk ■ Vvk + M™ffc — fvkdx = 0, Vf^ G Y^- (5.5) 

Jn 

Based on Theorem 15. II and Theorem 12. 61 we have the following proposition. 

Proposition 5.4. If the Laplacian operator A : W^'^'iyt) W^^''^{Vl) is an isomor- 
phism, then for ||/io(a^)||oo,Q sufficiently small, the Petrov-Galerkin problem (15.51) have a 
unique solution Uk G X^ in the neighbor ofu, which satisfies the a priori error estimate 

||m - Mfc||i,p,n < min \\u - XkWi.p.n- 

Proof. It is straightforward to check F'(u) is Lipschitz continuous. By assumption on 
A, L = —A satisfies the continuous inf-sup condition (15.21) . That is, we have 

inf sup (Vw, Vf ) = inf sup (Vw, Vf ) ^ ao > 0. 

w€X,\\w\\x=l ^(zY,\\v\\y=1 v€Y,\\v\\y = 1 w€X,\\w\\x=1 

We need to show that L satisfies the discrete inf-sup condition (15.31) . Let : Wq^'^(O) 
Yk be the Galerkin projection, i.e., for any v G Wq''{Q) 

{Vwk, V{v - Pkv)) = 0, Wwk G Xk. 

It is well known that ||Pfcv||y < ||f ||y, Vf G F, see [l56ll for example. For any Wk G 
Xk with llwfcllx = 1, by the continuous inf-sup condition, there exists a function v G 
Wo'^in) with II v\\y = 1 such that 



2 ^ iVwk,Vv) = {Vwk^VPkv). 



Hence 



sup {Vwk,Vvk) ^ (vwk, -n^Ar- ) ^ ^ii d° ii ^ "i > 0, 

Vk&Y„\Mr=l V \\PkV\\Yj 2||Pfci;||y 

for some constant ai. In the last step, we used the stability of P^. This proves the discrete 
inf-sup condition: 

inf sup (V^^;fc, Vf fc) ^ ai > 0. 

The conclusion then follows by Theorem 15. II □ 

Let a = f n r' be the interface between two elements r and r' E %, and be a fixed 
unit normal of a. For any w E Xk, vje denote the jump residual on a as [Vw ■ Ua] . Now 
we define the local error indicator 

Viw, tY := hlWw"^ - fWl^^^ + 5^ II [Vw ■ n^]\\l^^^ , G X,, (5.6) 

crCdr 
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1 

and define ri{w, S) := {J^res V^i.'^i ''")) ^ for any subset S <zTk- We also introduce the 
oscillation: 

oscfc(r) := KWf - 7ro/||o,p,r, 

where ttq is the element-wise W projection; and denote oscfc(5) := (S^-g^ osc^(r)) 
for any subset 5 C Tk- Follow the general framework developed in ||69l , we have the 
following a posteriori error estimates. 

Theorem 5.5. Let u G Wl'^iVt) be a solution to (|5.4I) . and Uk G Xk be the solution of 
Petrov-Galerkin equation (|5.5I) . Then for any subset S C % and v EY,we have 

{F{uk),v) < r]{uk,S)\\v\\Y{Qs) +v{uk,Tk\S)\\v\\Y(nr^^s)- 

Consequently, there exists a constant Cq depending only on m and the shape regularity 
ofTk such that 

\\u-Uk\\i^p^n^CQri{uk,Tk)- (5.7) 

Furthermore, there exist constants Ci and C2 depending only on m and the shape regu- 
larity ofTk such that 

V{uk,r) ^ Ci\\u - Uk\\i,p,u^ + C20SCk{ojr). (5.8) 

Finally, based on these observations, the adaptive algorithm for the nonlinear equa- 
tion (15.41) is convergent. 

Corollary 5.6. The adaptive algorithm for the nonlinear equation (15. 4h converges, that 
is, Uk ^ u as k ^ 00. Moreover, we have 

lim ri{uk,Tk) = 0. 

k—^oo 

Proof. By Theorem 15.51 r](uk,Tk) satisfies (|3.7a| ). To show (|3.7c| ), by the local lower 
bound and triangle inequality, we obtain: 

r]{uk,r) ^ Ci\\u - Uk\\i,p,u^, + C20sck{ujr) 

< C'i||Mfc||i,p,<^^ + Ci||m||i,p,^^ + C2\\h{f - 7ro/)||o,p,a;, 
^ II ^fc II 1,P,cJt ~^ II II OjPjOJt- ; 

where D depends only on / and u. Here we use the fact that u E Wq'^{^1), i.e., 

||M||l,p,a;(r) ^ C 

for some constant C ^ 0. Also notice that the local lower bound (|5.8I) implies (I4.4|) . then 
the conclusion follows by Theorem 14. H and Theorem 14. 3 [ □ 

As a second example of a semi-linear equation involving a system of equations, we 
consider the stationary incompressible Navier-Stokes problem: 

Example 5.7. Consider 

-z/Au + (u ■ V)u + Vp = f in ncR"^ 

divu = inQ, (5.9) 
u = on dfl 

where v is a constant viscosity of the fluid and f G L^(i7)°' is the given force field. 
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For this example, we let X = F = ifg (fi)*^ x Ll(n) with the graph norm 

ll[v,g]||x= (||v||?,^+||g||y^ 



Let 



(L[u,p], [v, q]) = / z/Vu : Vv — pdivv — qdivudx 
Jn 

(7V([u,p]),[v,g]) = [{uV)uY-fvdx. 

Jn 

When d = 2, since u G Hq{^1)'^, by the Sobolev Embedding Theorem, we have u e 
LP{Q) for all 1 < p < oo. Hence, (u ■ V)u is in L'^{Qf for all 1 ^ g < 2. Similarly, 



when (i = 3, we notice that u G ^^(fi) for all 1 < p ^ 6. Therefore, (u ■ V)u is in 

|. If we set Z = L'i{nY for | < g < |, 



L'i{VLf for all 1 ^ g ^ |. If we set Z = L'^iVtY for | < g < |, the property (S2) of 



is true with = 2 or 3. 

On the other hand, the operator L is given by the Stokes problem. It is well-known 
that the continuous inf-sup condition is satisfied by L. Given the triangulation 7^, we 
denote the finite element space := x Qfc C X, and assume that there holds the 
following discrete inf-sup condition for Stokes operator L on Vk x Q^. : 

(divvfe,gfe) 

\\qkh,n < sup — — . (5.10) 

We refer to ll67l l28l [T2l| for construction of finite element spaces that satisfy (15.101) . By 
this construction, the linear operator L satisfies the assumption (SI). 

The Petrov-Galerkin approximation to the equation (15.91 ) is: Find [ufc,pfc] G Xk such 
that 

{F{[uk,Pk]), [vfc,gfc]) = 0, V[vfc,gfe] G Xk, (5.11) 

where (F([ufe,pJ), [vk,qk]) := {L[uk,pk], [vfc,gj) + {N{[uk,Pk]), [vk,qk])- Based on 
Theorem [5T| and the abstract framework in Section[2l we have the following proposition. 

Proposition 5.8. The Petrov-Galerkin problem (15.1 II) for the stationary Navier-Stokes 
equation has a unique solution [n^, Pk\ in a neighborhood of [u, p] . 

Error indicators for this equation have been developed by several papers, see [[T3l[68l 

[TOl . For any r G 7^, we define ri{\uk,Pk\, r) as 

r]{[\ik,Pk],rf := - z/Au^ + (uA:V)uk + Vpfc - f ||o,^ 
+ lldivufel'^ 



+ K IIK'^Vufc -Pk) -n 

and define the oscillation by 



0,T 

2 

0,0- ' 

Cdr 



oscfe(r) := h^Wf - 7ro/||o,r- 
As usual, for any subset 5 C 7^ we denote 

ri{[uk,Pk],S) := I ^r]'^{[uk,Pk],r) | andosCfc(5) = I ^oscfc(r)^ 
We then have the following a posteriori error estimates. 
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Theorem 5.9. Let [u, p] E X be a locally unique solution to (15.91) . and [\ik,pk\ G 
be the solution to the Petrov-Galerkin problem (15.1 II) . Then for any subset S G Tk and 
f G X, we have the nonlinear residual estimate 

(F([ufc,pfc]), [v,g]) < r7([ufc,pfc],5)||[v,g]||x(ns) 

+r]{[uk,Pkirk\S)\\[w,q]\\x[nr^^s)- 
Moreover, we have the following a posteriori error estimates: 

(l|u-Ufc||?^j^+ \\p-Pk\WnY ^ ^([ufc,Pfc],7;), 

?7([ufc,pfc],r) < (||u- Ufcll^^^^ + \\p-Pk\\l,^^Y +OSCk{uJr). 

Therefore, the adaptive algorithm for the nonlinear equation (15.71) is convergent. 

Corollary 5.10. Let [u,p] E X be a locally unique solution to (15.91) . and [\\.k,Pk] G Xk 
be the solution to the Petrov-Galerkin problem (15.111) at each adaptive step k. Then we 
have [uk,Pk] — ?• [u,p] as k ^ oo. Moreover, we have 

lim r]{[uk,Pk],Tk) = 0. 

fc— >oo 

Proof. The estimate (|3.7al) of the error estimator r]{[uk,Pk],Tk) follows by Theorem 15. 9[ 
Similar to Corollary 15. 61 we can easily show (|3.7cl) by the local lower bound and triangle 
inequality. The conclusion then follows by Theorem l4.1l and Theorem 14. 3 1 □ 

5.2. A Quasi-Linear Example. 

Example 5.11. We now consider a quasi-linear example, the stationary heat equation 
with convection and nonlinear diffusion: 

= -div(K(M)VM) + b ■ Vm - / = inVt, (5.12) 

with homogeneous boundary condition u\q^ = 0. We assume for all s G M, k{s) G 
C^(M) satisfies k{s) ^ a ^ and |/t'^'^(s)| ^ 7^, for I = 0, 1,2, for some constants 
a, 7o, 7i, and 72. We assume the vector field b G W^'°°{QY such that divb = 0, and 

Let X = W^'^in) and Y = PFo''^(fi), with i + i = 1. As before, we let Xk C X and 
Yk C Y he the piecewise linear continuous finite element space defined on Tk- The finite 
element approximation of the equation (15.111) reads, 

find Uk G Xk, such that / K{uk)Vuk ■ Vf ^ + hVukVk — fvkdx = 0, Vf^ G Yk. 

Jn 

(5.13) 

We have the following properties. 

Proposition 5.12. Let u G Wl'^iVt) be a locally unique solution to the equation (|5.12l) . 
Then the mapping F : X Y* is of class for 2 < p < 00, and F'{u) is an 
isomorphism form X to Y*. Moreover, if the Laplacian operator A : X ^ Y* is an 
isomorphism, then for \\hQ{x)\\oo,n sufficiently small, the Petrov-Galerkin problem (15.131) 
have a unique solution Uk & Xk. 

Proof. We refer to [fT3l for the proof of the first part of this proposition. The second part 
of the conclusion then follows by Theorem (|2.6I) . □ 
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For the a posteriori error estimator, we introduce 

7]{uk, rf := hP\\ - dYv{K{uk)Vuk) + b ■ Vuk - /||o,p,r 

+ Eacar ha II [K{Uk)VUk " u] , 

and define tlie oscillation by 

osc^(r) := K W - ^o) (-div(/€K)VMfc) + h-Vuk- f)\\l^^, 
+ ^K\\ [(/ - Tii)K{uk)Vuk ■ n] , 

aCdr 

where ttq and tti are the element-wise projections onto the Vo and Vi spaces respec- 
tively. Also, we denote 



ri{uk,S) := I ^r]P{uk,T) j andosCfc(>S) := | ^osc^(r) 

\rG5 / \re5 

for any subset S d Tk- Again, following the general framework in ||68ll69ll , we obtain a 
posteriori error estimates. 

Theorem 5.13. Let u G Wl'^iVL) be a locally unique solution to (|5.12l) . Then we have 

{F{uk),v) < ri{uk,S)\\v\\Y(ns) + v{uk,Tk\S)\\v\\Y(n^^^s), 

for any subset S C Tk and v eY. Furthermore, we have the following a posteriori error 
estimates: 

\\u - Uk\\i,p,n < v{uk,Tk), 

V{Uk,r) < \\U - Uk\\l,p,uj{T) + OSCk{uJr)- 

Finally, based on the results above, the adaptive algorithm for the quasi-linear station- 
ary heat equation (15.121) with nonlinear diffusion is convergent. 

Corollary 5.14. Under the hypothesis of Theorem 15.721 if p ^ 2d then the adaptive 
algorithm for the nonlinear equation (15.121) converges, that is, 

lim Uk = u, and Um ri{uk, Tk) = 0. 

fc— J-OO k—^OD 

Proof. Again, the error estimator r](uk, %) satisfies (|3.7a| ) due to Theorem 15. 131 Now 
we prove that Assumption (|3.7cl ) holds. We start with the local lower bound in Theo- 
rem[5ll 

V{Uk,r) < ||m - Mfe||l,p,a;(r) + OSCfc(Wr) 

^ lkfc||l,p,a;(r) + 1^^ || l,p,a;{T) + OSCfc(w^). 

Since b e W^^'^iVt), we have 

||(/-vro)/||o,p,r ^ ||/||0,p,., 

||(/ - 7ro)b ■ VMfc||o,p,r < ||VMfc||o,p,r- 
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On the other hand, we have 

II (/ - TTo) (-div(K(Mfc)Vnfc))||op^^ 
^ ||(/-7ro)/€'M||o^p^jVMfc|2 

^ Chr ll^^'Vufcllo^p^^ IVWfcl^ 
^ C72/lr||VMfe||o,p,r|VMfcP 
^ C72||VMfc||o,p,r||VMfc||o,p_^ 
< ||VMfc||o,p,r- 

In the third step, we used the boundedness of k"(s) and the fact that Vwfc is constant in 
r; in the fourth step, we used the assumption p > 2d; and in the last step, we used the a 
priori estimate of Uk, namely, 

||wA;||i,p,n ^ IIm — Mfc||i,p,n + llwlli^p^n ^ C 

for some constant C (cf. Theorem 12.61) . Similarly, by noticing k is uniformly bounded 
and the a priori error estimate of Uk, one can easily obtain 

II ~ ni)K{uk)Vuk ■ ^]||o,p,^ < ||Vufc||o,p,a;,. 

crCdr 

Therefore, we obtain the stability estimate 

Vi'^k, , t) < ll^^felll.p.o;, + II-DII 

where D depending only on u and /. The conclusion then follows by Theorem 14.11 and 
Theorem gJl □ 



6. An Abstract Contraction Framework 

In this section, we develop a second distinct abstract convergence framework to al- 
low for establishing contraction under additional minimal assumptions. The framework 
generalizes the AFEM contraction arguments used in [|45l [T6l [50l [35l [311 to general ap- 
proximation techniques for abstract nonlinear problems. The three key ingredients to the 
contraction argument are as in the existing linear frameworks: quasi-orthogonality, error 
indicator domination of the error, and a type of error indicator reduction. 

6.1. Quasi-Orthogonality. One of the main tools for establishing contraction in adap- 
tive algorithms is perturbed- or ^wa^Z-orthogonality. Let Xi C X2 C X and Yi C Y2 C 
Y be triples of Banach spaces, and consider (for the moment, arbitrary and unrelated) 
Ui G Xi, U2 E X2, and m G X. If X also had Hilbert-space structure, so that the native 

1 /2 

norm || ■ ||x on X was induced by an inner product ||-||x = ('5')x ' ^^'^ orthogonality 
were to hold (n — U2, U2 — ui)x = 0, then one would have the Pythagorean Theorem: 

11^ ~ ""illx = 11^ ~ ""allx + ||^^2 - ui\\x- (6.1) 

Quasi-orthogonality is a more general concept whereby one gives up orthogonality (|6.1I) . 
and instead works with inequalities involving a (semi-)norm ||| • ||| that could be the native 
norm || • ||x, or more generally could be an energy norm or semi-norm particularly suited 
to the problem at hand. From the triangle inequality in the Banach space X together with 
the discrete Holder inequality, one always has the following inequality: 

AIIm - Mill^ ^ \\u - U2\\x + \\U2 - Ui\\x, (6.2) 
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with A = 1/2. Quasi-orthogonality then refers to establishing the more difficult inequal- 
ity in the other direction to supplement (16.21 ): 

A\\u - Ui\\x ^ \\u - U2\\x + \\U2 - Ui\\x, (6.3) 

for some A ^ 1, which is convenient to write in the form 

\\u — U2\\'x ^ A||m — — 11^2 — Mlllx- (6.4) 

We wish now to develop conditions for establishing (16.41) . We will see shortly that it will 
be critical for us to be able to establish (|6.4I) with constant A close to one; this will only be 
possible if n — ^2 and U2 — ui are nearly "orthogonal" in some generalized sense, which 
implies that we must work with a norm related to the Petrov-Galerkin (PG) "projection" 
process, and may require that we work with a norm other than the native norm || ■ 
To this end, consider a continuous bilinear form •) on X x Y: 

6:XxF-^M, h{u,v) ^M\\u\\x\\v\\y. ^ueX^veY. (6.5) 

Assume h satisfies inf-sup conditions on X and Y: There exists /3o > such that 

inf sup h{u, v) = inf sup b{u, v) = f3o > 0. (6.6) 

ueX,\\u\\x=l y^Y,\\v\\Y=l veY,\\v\\Y=l ueX,\\u\\x=l 

In the subspaces Xk and Yk, A; = 1, 2, we assume b satisfies a discrete inf-sup condition: 
There exists a constant /3i > such that 

inf sup b{u, v) = inf sup b{u, v) ^ f3i > 0. (6.7) 

«GXfc,||i)||x=l y^Yk,\\v\\Y=l v€Yk,\\v\\Y=l u£Xk,\\u\\x=l 

Given now / G F*, we assume that u E X is the solution to the operator equation 
involving b and /, and that ui E Xi and U2 E X2 are corresponding PG approximations: 

Find ueX such that b{u, v) = f{v), eY. (6.8) 

Find Ml E Xi such that f 1) = /(vi), Vf 1 EY1CY2 CY. (6.9) 

Find U2 E X2 such that b{u2, V2) = f{v2), \/v2 E Y2 C Y. (6.10) 

With this setup, we can establish the quasi-orthogonality inequality in the norm || • ||x 
for PG approximations defined by any continuous bilinear form satisfying inf-sup condi- 
tions. 

Theorem 6.1. Assume the bilinear form 6 : X x y — )■ M satisfies the continuity (|6.5I) 
and inf-sup conditions (|6.6I) and (16.71) . Assume that u, ui, and U2 are defined by (|6.8I) . 
(16.91 ). and (|6.10l) . respectively. Then quasi-orthogonality (|6.4I) holds with 

( imV 

A= 1 + — ^1. (6.11) 



/3i 

Proof. With no inner-product we have only the following type of generalized "orthogo- 
nality" to exploit: 

b{u - U2,V2) = 0, \/V2 EY2CY, (6.12) 

b{u - ui,vi) = 0, \/vi EY1CY2CY, (6.13) 
which are obtained by subtracting (16.91) and (16.101) from (16.81 ). This leads us to: 

b{u - Ui,V2) = -b{Ui - U2,V2) = b{u2 - Ui,V2), V'U2 G ^2- (6.14) 

Combining (16.141) with the inf-sup condition (16.71) and continuity (|6.5I) gives the estimate 

on II ^ b{u2-Ui,V2) ^("~"1'^2) , ,.|| . 
Pl\\U2—Ui\\x ^ sup 7 7 = sup ^ M \\U—Ui\\x ■ (6.15) 

0jtv2eY2 ll'^2||y 0j^v2eY2 \\'^2\\y 
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Starting with the triangle inequality 

11^^ - U2\\x ^ \\u - Ui\\x + \\U2 - Ui\\x, (6.16) 

we add twice (16.151) to (16.161) to obtain 

11"" — U2\\x + \\u2 — ui\\x ^ A||m — Ui\\x, (6.17) 
with A = 1 + 2M/ /?! ^ 1. If we square both sides we obtain 

\\u - U2\\x + 2||^i2 - ^^illxll^i - '"2IU + 11^2 - '"illx ^ A^ll'u - nill^. (6.18) 
The second term on the left is non-negative; we drop it to give (16.41) with A = A^. □ 

It is clear from the proof of Theorem 16. II that to establish (16.41 ) with constant A ^ 1 
close one, one must establish a version of (|6.5I) that, when restricted to particular argu- 
ments from subspaces, will hold with constant M ^ close to zero. This then resembles 
some type of strengthened Cauchy inequality. Note that if X = Y, then the bilinear form 
■) defines an energy (semi-)norm through: 

b:X X X ^R, \\\wf = b{w, w), G X. (6.19) 

It will be more fruitful to consider quasi-orthogonality with respect to this semi-norm: 

III'" ~ ""211^ ^ A|||n — Uilp — 11^2 — t^ilp. (6.20) 
To establish (16.201) . it will be useful if strengthened Cauchy inequalities hold: 

b{u — Ml, f 1) ^ 7|||m — Mill |||fi|||, b{vi, u — Ml) ^ 7|||'U — mi||| |||mi|||, (6.21) 

VmiGXi, 7e[0,i). 
In this case, one immediately has the following without inf-sup conditions: 

Theorem 6.2. Let Xi C X2 C X be a triple ofBanach spaces, and let u E X, ui E Xi, 
and U2 G X2 be such that the bilinear form 6 : X x X — )■ M satisfies the strengthened 
Cauchy inequality (|6. 211) /or some 7 G [0, 1). Then quasi-orthogonality (|6.20l) holds in 
the energy semi-norm ||| ■ |p = ■) with A = 1/(1 — 7) ^ 1. 

Proof. We begin with the identity 

ll'U — Milp = |||n — n2|p + 11^2 — Milp 

+b{u - M2, U2 - Ml) + 6(m2 - Ml, m - M2). (6.22) 

Using (|6.21l) in (|6.22l) and Cauchy-Schwarz inequality gives 

|||m — Milp ^ |||m — M2IP + |||m2 — Milp — 7|||m — M2IP — 7|||m2 — wilp 

= (1-7) (|||M-t^2|ir+ 111^2 -Mif), (6.23) 
which gives (16.201) after multiplication byA = l/(l — 7)^1. □ 

While Theorem 16.21 gives quasi-orthogonality in the energy norm ||| ■ ||| without inf- 
sup conditions, it is important to point out that establishing the Cauchy inequality in the 
energy norm usually goes through the native norm || ■ \\x, and then relating the Cauchy 
inequalities in the two norms requires additional structure such as inf-sup conditions. To 
make this more clear, let us assume that X = Y and &(■,■) : X — )■ X is coercive for 
m = = I3i> Q : 

"^ll'^llx ^ lll'^IlP = ^(^5 Vw G X. (6.24) 
To allow for a weaker condition than coercivity (16.241 ), it is useful have a (Gelfand) triple 
ofBanach spaces X C Z C X*, with continuous embedding of X into the intermediate 
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space Z. A Garding inequality with m > and Cq ^ for the form ■) is then a 
possibility: 

"^Ikilx ^ llkf + Cclklll, Vw e X. (6.25a) 

If Cg = 0, then inequality (I6.25al) reduces to (I6.24h . To exploit (I6.25al) . we need a lifting 
inequality between X and Z when G X2 and ui G Xi are approximations of n G X: 

\u-u2\z ^CL(y\u-U2\x, \u2-ux\z ^CL(yQ\u2-U}\x, (6.25b) 

where it is assumed that oq = ca for fixed c > 0, and that a can be made arbitrarily small 
for sufficiently large subspaces Xi C X2 C X, where typically ui G Xi and M2 G X2 are 
PG approximations to m G X. Inequalities (|6.25bl) can be established using the "Nitsche 
trick" with certain regularity assumptions; cf. (451 [351. The usefulness of (16.251 ) is made 
clear by the following Lemma [63l due essentially to Schatz [|60l . 

Lemma 6.3. Assume the bilinear form 6 : X x X — )• M satisfies (I6.25al) - (l6.25bl) . 
Then (16.241) holds with w = u — U2 and w = U2 — Ui for a sufficiently small, with 
constant m = m ~ CcC'la'^ > 0. 

Proof. We observe that 

m||n — M2||x ^ — n2|p + CgII^^ — ^^2||| ^ |||^^ — M2IP + CGC'|o"^||n — M2||x, (6.26) 

which implies the result for a > sufficiently small. We note that < m ^ m, with 
m = m when coercivity holds (Cc = 0). The same argument for U2 — Ui gives the same 
result with slightly different constants appearing in the argument. □ 

6.2. Global Quasi-Orthogonality for Semilinear Problems. We now consider a non- 
linear problem for which we can establish a strengthened Cauchy inequality, and then 
subsequently quasi-orthogonality, globally in X. We use a Lifting-type argument requir- 
ing the PG approximation space be sufficiently good (large). Such an approach is used 
in [45] for nonsymmetric linear problems, and in ll35l[3ll for semilinear problems. 
Let Xi C X2 C X be a triple of Banach spaces, and consider F : X — )• X* such that 

F{u) = Lu + N{u), LgL(X,X*), X : X ^ X*. (6.27) 

The operator L induces a bilinear form 6 : X x X — )■ M and subsequently an energy 
(semi-) norm ||| • ||| : X ^ M, through the relations 

b{u, v) = {Lu, v), Vm, V eX, \\\u\\\ = b{u, Vm G X. (6.28) 

We have the equations for u G X and its PG approximations ui G Xi and ^2 G X2: 

b{u,v) + {N{u),v) = 0, yveX, (6.29) 
b{ui,vi) + {N{ui),vi) = 0, Vt;i G Xi C X2 C X, (6.30) 
b{u2,V2) + {N{u2),V2) = 0, Vt;2GX2CX. (6.31) 

We will need the following Lipschitz property (globally in X) for term X(-): 

{N{u) - N{u2),V2) ^ K\\u-U2\\z\\v2\\x, Wv2eX2, (6.32) 

where u is the exact solution and U2 is the PG approximation in X2, and where Z is 
part of the triple X C Z C X* as in Section 16. 1[ By splitting F into a linear part L 
satisfying continuity and Garding assumptions, and a remainder X satisfying only the 
Lipschitz assumption, we will be able to establish both Cauchy inequalities and subse- 
quently quasi-orthogonality, globally in X, for a large class of nonlinear problems. 
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Theorem 6.4. Let u, Ui, and U2 satisfy (I6.29I )- (I6.31| ). and let the Lipschitz (16.321) and 
Lifting (|6.25b|) conditions hold. Let the energy norm ||| • ||| induced by b{-,-) as in (16.281) 
satisfy the Gdrding inequality (I6.25al) . Then for a sufficiently small, b{-,-) satisfies the 
strengthened Cauchy inequality (16.211) with 7 = KCi'j/ijn — CcCj^cr^) G (0, 1), and 
the quasi-orthogonality inequality (16.201) holds with 

^^l^^ 1 - KCLa/{m - CcCla^) ^ ^^'^^^ 

For sufficiently small a, 7 can be made arbitrarily small and A can be made arbitrarily 
close to one. 

Proof. Subtracting (16.291 ) and (|6.31l) with v = V2 = U2 — ui, have 

\h{u- U2,U2- ui)\ = \ - {b{u) - b{u2),U2 - Ui)\ 

< K\\u - U2\\z\\u2 - UiWx 
^ lx\\u ~ U2\\x\\u2 - Ui\\x, 

after using (|6.32l) and (I6.25bl) . where 7^ = KC^a G (0, 1) for a sufficiently small. 
Since ■) satisfies the Garding inequality (I6.25al) . then by Lemma [63l we have 

\b{u- U2,U2- ui)\ ^ 7|||m - M2IIIIIIM2 - ^iilll, 

where 7 = 'yx/rn = KCL(y/{m — CcC'la'^) E (0,1) for a sufficiently small. By 
Theorem 16.21 we have (16.201) holds with A as in (16.331 ). which can be made arbitrarily 
close to one for cr > sufficiently small. The conclusion then follows by Theorem 16. 2[ 

□ 

6.3 . Local Quasi- Orthogonality for General Nonlinear Problems. Consider now gen- 
eral operators F : X — )• X*, where Xi C Xi C X is a triple of Banach spaces. We have 
equations for m G X and its PG approximations in ui G Xi and M2 G X2: 

{F{u),v) = 0, \/veX, (6.34) 

{F{ui),vi) = 0, Vt;i G Xi C X2 C X, (6.35) 

{Fiu2),V2) = 0, Vt;2GX2CX. (6.36) 

Not having access to any additional structure in F to exploit as in the semilinear case, 
we will need to work locally in an eo-ball around u G X for some cq > 0: We assume 

||M-MilU^eo, ||m - MolU ^ Co- (6.37) 

We assume F' is Lipschitz in the ball: There exists a Lipschitz constant L > such that 

||F'('u) — F'{w)\\c{x,x*) ^ L\\u — wWx, Vw G X s.t. \\u — w\\x ^ eo- (6.38) 

Define now the bilinear form 

6 : X X X ^ M, b{w, v) = {F'{u)w, v). (6.39) 

We then have the Cauchy inequality locally in the co-ball, leading to quasi-orthogonality. 

Theorem 6.5. Let u, Ui, and Uq satisfy (I6.34| )- (|6.36l ). and let the Locality (16.371) and 
Lipschitz (16.381) conditions hold, with &(■,■) defined as in (16.391 ). 

(1) Ifb{-, ■) satisfies coercivity (16.241) . then ■) satisfies the Cauchy inequality (16.211) 
with 7 = eo-L/(2m) G (0, 1), and quasi-orthogonality (16.201 ) holds for eo > 
sufficiently small with 

1 
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(2) Ifb{-, ■) satisfies Garding (|6.25a|) and the Lifting (|6.25b|) inequalities, then b{-, ■) 
satisfies the Cauchy inequality (16.211 ) with 7 = eo-L/(2[m — CcCla^]) G (0, 1), 
and quasi-orthogonality (16.201) holds for cq > and cr > sufficiently small with 

^ = 1 , /.or ^ 2n ^ 1- (6.41) 

l-eoL/(2[m-(:7GC^(T2j) 

In either case, the constant 7 can be made arbitrarily small, and the constant A can 
be made arbitrarily close to one, for sufficiently small eg and a. 

Proof. Subtracting (16.341) and (|6.36l) with v = V2 = U2 — U\ G X2 we have 

(FH-F(n2),U2-ni) = 0. (6.42) 
We also have the mean-value formula: 

F{u + w) = F{u) + F'(m)w + I \F\u + sw) - F\u)\ w ds. (6.43) 

Jo 

Using (|6.43l) with w = U2 — u together with (16.421) gives: 

- U2, U2 - ^^l)| = \ - b{u2 - U,U2 - Ui)\ 



= \ - {F'{U){U2 - U),U2 - Ui)\ 

= \ - {F{u + [U2 - u]) - F{u),U2 - Ui) 

+ ( / [F'{U + S[U2 - U]) - F'{u)] {U2 - u) ds, 

Jo 

^ (^j^ \\F\[l-s\u + su2)-F\u)\\c(x,x*)d^ 



U2 - 



■||^^ - U2\\x\\U2 - UxWx- 

Using now (16.381) and (16.371) we can establish the Cauchy inequality (|6.21l) as follows: 

- U2, U2 - ni)| ^ CL||?i-M2||x / -s (is^ ||m - U2||x||'"2 - '"ilU 

X, 



^ 7x||m - M2||x||m2 - Mil 



where 7x = eoiv/2 G (0, 1) for eo sufficiently small. 

If ■) satisfies the coercivity inequality (16.241) for m > 0, then we have established 
the Cauchy inequality (16.211) . with 7 = 7x/w^ = ^oL/{2m) G (0, 1) for eo sufficiently 
small. By Theorem 16. 2[ we have (16.201) holds with A as in (|6.40l) . which can be made 
arbitrarily close to one for eo > sufficiently small. 

Instead of coercivity, if &(■,•) satisfies the Garding (|6.25a|) and lifting (|6.25b| ) in- 
equalities, then by Lemma [63] we have established the Cauchy inequality (16.211 ), with 
7 = ^x/{m - CcCla^) = eoL/[2(m - CcCla'^)] G (0, 1) for eo sufficiently small. By 
Theorem 16.21 we have (16.201) holds with A as in (16.411 ), which can be made arbitrarily 
close to one for eo > and cr > sufficiently small. □ 

6.4. Contraction. We now establish a contraction result for approximation techniques 
for nonlinear equations on Banach spaces, which is an abstraction of the contraction 
arguments in |l451ll6l[50l|35l[3ll|. Let Xi C X2 C X be a triple of Banach spaces, let 
u E X, and let ui G Xi and U2 G X2 be approximations to u. We are interested in the 
quality of the approximations; as such, the following three distance measures between 
the three solutions are of fundamental importance: 

62 = \\u - U2\\x, ei = ||m - Millx, El = \\U2 -Ui\\x, (6.44) 
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where || ■ ||x is a norm on X; this could be either the native Banach norm, or more 
generally a norm associated with a problem- specific bilinear form. We are interested in 
approximation algorithms which involve abstract "error indicator" functionals that will 
be taken later to be practical implementable a posteriori error indicators commonly used 
in AFEM algorithms: 

r^i : Xi ^^ M, r]2 : X2 ^ M. (6.45) 

When written without arguments, these functionals are taken to be evaluated at ui and 
U2 respectively, and represent approximations to the error: 

?7i = ?7i(mi) ~ ei, r]2 = r]2{u2) ^ €2- (6.46) 

In order to build a contraction argument involving the errors, we will need three funda- 
mental assumptions relating the five quantities above: 

Assumption 6.6 (Quasi-Orthogonality). There exists A ^ 1 such that 

el ^ kel - El. (6.47) 

Assumption 6.7 (Upper-Bound). There exists Ci > such that 

el^Cvl k = l,2. (6.48) 

Assumption 6.8 (Indicator Reduction). There exists C2 > and u G (0, 1) such that 

ril ^ C2EI + (1 - uj)rH. (6.49) 

Using these three assumptions, we have the following abstract contraction result. 

Theorem 6.9 (Abstract Contraction). Let Xi C X2 C X be a triple of Banach spaces, 
let li G X, let Ui G Xi and U2 G X2 be approximations to u with error defined as 
in (16.441 ). and let rji and rj2 be error indicators as in (16.451 ). Let the Assumptions \6.6\ \6.7\ 
and \6.8\ hold. Let f3 G (0, 1) be arbitrary, and assume the constant A in Assumption \6.6\ 
satisfies the bound: 

1 ^ A < 1 + (6.50) 

Then there exists 7 > and a G (0, 1) such that: 

el + ^r]^^^a^el + ^r]l), (6.51) 
where 7 can be taken to be anything in the non-empty interval 

(A-i)Ci . r 1 ACi] ,,,,, 

< 7 < mm <^ — , — ^ } , (6.52) 



I3lo ' [C2 f3u 

and where a is subsequently given by o? = max{a^, } G (0, 1), with 

0<al = A- ^ < 1, < al = (3]uj < I. (6.53) 

Proof. Beginning with Assumption |6.6l we have for any 7 > 0, 

el + ^Tjl ^ Ael - El + ^rjl (6.54) 
Using now Assumption |6.8[ we have 

el + W2 ^ Ae^ -El + ^ [C2EI + (1 - co)ril] . (6.55) 

Assume now that < 7 ^ I/C2. In this case, the negative term involving Ef dominates 
the positive term, which implies: 

el + 77/2 ^ ^ _ ^)^2^ 
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We now split the negative contribution involving r]f into two parts, using any (3 G (0, 1): 

el + ^ Ae? - I3ujWi + 7(1 " [1 " I^PWi- (6.57) 
We now finally invoke Assumption l6.7l on the first term involving /3: 

el + ^r^l ^[k- ^) el + 7(1 - [1 - P]uj)ril = «?e? + a^vf, (6-58) 

where al and a"^ are as in (16.531) . Note tu E (0, 1) from Assumption l6.8l and also for any 
(3 e (0, 1) it holds 1 - /3 e (0, 1) and [1 - (3]co E (0, 1). Therefore, for any (3 E (0, 1) 
we have that al satisfies the second inequality in (16.531 ). It remains to determine 7 > 
so that < < 1, with as given in (16.531 ). leading to 

^ < 7 < ^. (6.59) 

We have already imposed 7 > and 7 ^ I/C2. Recalling A ^ 1, to ensure a{ E (0,1) 
we must have 7 in the the interval (|6.52l) . If A = 1, this interval is clearly non-empty 
for any (7i > 0, C2 > 0, /3 G (0, 1), and cu G (0, 1). If A > 1, since the term involving 
A in the upper-bound always dominates the lower bound, to ensure the interval for 7 is 
non-empty we must restrict A so that (A — l)Ci/{l3oj) < I/C2. This holds if A lies 
in the interval (16.501) . We now simply note that this interval for A is non-empty for 
any Ci > 0, (^2 > 0, /3 G (0, 1), and u E (0, 1). To finish the proof, we now take 
= max{af, 0:2} G (0, 1). □ 

We now establish the main contraction and convergence result we are after. 

Theorem 6.10 (Abstract Convergence). Let {X^j^^, Xk C Xk+i C X, VA; > 0, 

be a nested sequence of Banach spaces. Let u E X, and let {m^I^q be a sequence 
of approximations to u from X^. Let the Assumptions \6.6\ \6.7\ and \6.8\ hold with the 
same constants A, Ci, C2, and u, for any successive pair of approximations and 
Uk+i and their corresponding error indicators rj^ and rjk+i. Let a, (3, 7, and A be as in 
Theorem \6.9\ Then the sequence {uk}'^=i contracts toward u E X according to: 

el+i + Wk+i ^ {el + ivl) , (6.60) 
and therefore converges to u E X at the following rate: 

el + ivl^Ca'^ (6.61) 
for some constant C = C{ui, rji, A, Ci, C2, a, (3, 7, u). 

Proof. Both results follow immediately from Theorem |6.9l □ 

7. Convergence Based on Contraction and Some Examples 

Here we use the abstract contraction result (Theorem 16.91 ) established in Section [6] to 
prove a contraction result (Theorem 17 . 61 below) for the adaptive algorithm described in 
Section [3l Theorem |6.9| was based on three core assumptions: Quasi-orthogonality, Indi- 
cator domination of the error, and Indicator Reduction. We showed how to establish the 
first of these, namely the Quasi-Orthogonality Assumption 16. 6[ for PG approximations 
for two general classes of nonlinear problems in Section 16.11 The second assumption, 
namely the Indicator Domination Error Assumption |6.7l is a standard result for residual- 
type indicators; our adaptive algorithm produces indicators with this property, cf. (|3.7b| ). 
We focus on establishing the third assumption, namely the Indicator Reduction Assump- 
tion [6]8l in Section ITTI below. and then prove the main contraction result in Theorem 17. 6 1 
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for the adaptive algorithm (13.41) . based on Theorem 16.91 We then apply this contraction 
result to several nonlinear PDE examples in Sections |7!2l - |7.4[ 

7.1. Contraction of AFEM. What remains in order to use the abstract contraction re- 
sult in Theorem 16.91 for AFEM is the third assumption, namely the Indicator Reduction 
Assumption 16. 8[ Following lfT6l |50 ]. we will first reduce establishing Assumption 16.81 
to a simpler local Lipschitz assumption on the indicator, namely Assumption 17 . 1 1 below. 
Establishing Assumption 16.81 will then reduce to an assumption on the marking strategy 
in the AFEM algorithm; we satisfy this assumption by using the standard Dorfler strat- 
egy (13.91) . Admissible discrete functions in Assumption 17. II refer to discrete functions 
which are known a priori to satisfy specific properties of discrete PG approximations, 
such as discrete a priori bounds. We will later show how to establish As sumption |7 . 1 1 for 
several nonlinear problems in Section [5] using continuous and discrete a priori bounds. 
To simplify the presentation below, we will denote 

Cfc = - Mfclll, Ek = \\uk - Uk+il, 

Vk = viuk, Tk), rik{Mk) = viuk, Mk), ?7o(D) = ^70(0, %), 
where D represents the set of problem coefficients and nonlinearity. We also denote 
Vk := VoiTk) for simplicity. 

Assumption 7.1 (Local Lipschitz). Let T be a conforming partition. For all t E T and 

for any pair of admissible discrete functions v,w G X{T), it holds that 

\r]{v,T) -r]{w,T)\ ^ Air]{I),T)\\v - w\\i^2,uj^, (V.l) 

where Ai > depends only on the shape-regularity of%, and where ri(D, r) depends 
only on appropriate norm behavior of the equation coefficients over the local one-ring of 
elements surrounding t, and on the Lipschitz properties on r of the nonlinearity acting 
on admissible functions in X{T). The parameter ri{Y), r) is assumed to be monotone 
non-increasing with mesh refinement. 

Based on Assumption l7.1[ we have the following indicator reduction result (see also ll35l 
[3TI ). which extends the linear case appearing in [|T6ll50l to the nonlinear case. The proof 
is essentially identical to that of [16, Corollary 4.4], except that it allows for nonlinearity 
in Assumption 17. 11 we include it for completeness. The main difficulty in the nonlinear 
case will be establishing Assumption 17 . 1 1 and simultaneously satisfying the assumption 
on the parameter A appearing in Lemma 17^ 

Lemma 7.2 (Nonlinear Indicator Reduction). Let T be a partition, and let the param- 
eters 9 G (0,1] and i ^ 1 be given. Let M = MARK{{7]{v,T)}reT,T,6), and 
let % = REFINE{T,M,i). If Ai = {d + l)Al/i with Ai from Assumption^ and 
A = 1 — 2^'^^/'*) > 0, then for all admissible v G X{T), G X{%), and any S > 0, it 
holds that 

r,\v,, %) ^ (1 + 5)[r,\v, T) - \r,\v, M)] + (1 + 5-^)A^r,\Ti, %)\\v. - vf. 

Proof. The proof follows that in [[T6 i Corollary 4.4], with minor adjustment to allow the 
Lipschitz parameter in Assumption 17. II to depend on point- wise behavior of admissible 
functions in an interval; we outline the argument here for completeness. Using 
Assumption |7 . 1 1 with v and taken to be in X(7Z), gives 

Vi^*,'^*) ^ vi^^T*) + Aii](D,T^)\\v^ - v\\i^2,ujr, Vr* G %. 
After squaring both sides and applying Young's inequality with arbitrary 5 > we have 

v\v*,n) ^ {l + 6)r]\v,n) + (1 + r^)AV(D,^*)ll^* -^llUo;,, Vr, G %. 
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We now sum over the elements E %, using the fact that for shape regular partitions 
there is a small finite number of elements in the overlaps of the patches cUr, that are 
multiply represented in the sum. This gives 

V\v,, %) ^ (1 + 5)ri\v, %) + (1 + r i)A?r/2(D, T*) \\v. - vf, 

where we have used equivalence between the energy norm and the norm on (based 
on either coercivity or a Garding inequality together with lifting; cf. Lemma [6Jl) . and 
then absorbed both the norm equivalence constant and the finite over-representation of 
elements in the sum into the new constant Ai. 

Now take admissible v E X{T)\ a short argument from the proof of Corollary 4.4 
in [fT6l gives 

Tl\v,%)i^rf{v,T\M) + 2-^'/''Wv,M)=rf{v,T)-\Tl\v,M). {12) 

Finally, monotonicity r7(D, %) ^ ^^(D, To), combined with (17.21 ) yields the result. □ 

Remark 7.3. The difficulty in the nonlinear case will be establishing Assumption f7J] and 
simultaneously satisfying the assumption on the parameter A appearing in Lemma \772\ In 
the case of problems for which we can control the nonlinearity using a priori control 
of solutions and discrete approximations, we will be able to establish Assumption \7.1[ 
several such examples of increasing difficulty are analyzed in Sections \7.2[]7.4\ The 
assumption on A appearing in Lemma \7.2\ is essentially the assumption that the residual 
indicator contains only terms that decay as with 11°" for some a > 0. 

We will now make use of the Dorfler marking strategy (13.91 ). This simple marking 
strategy will ensure that the abstract indicator reduction As sumption [6]8] holds. 

Lemma 7.4. Let the conditions for Lemma \7.2\ hold. Let the Dorfler marking prop- 
erty (13.91 ) hold for some 9 E (0, 1], and restrict 6 > in Lemma \72\ so that 

0<«<^. (7.3) 

Then Indicator Reduction Assumption holds with C2 = (1 + (5~^)Ai?7^(D, To) cind 

uj = 1- {1 + 6){1- Xe^) E (0,1). (7.4) 
Proof. By Lemma 1712] we have for any 5 > 0: 



V\v,,%) ^ (1 + 6)[r]\v,r) - \r]\v,M)] + (1 + 6'')A,r]\-D,ro)\\\v. 



v\ 



l|2 



The Dorfler marking property (13.91) gives 

V'{v,, %) ^ (1 + 5)(1 - \e^W{v, r) + (1 + ri)Air72(D, ro)|||i;. - vl\ 
which we will write as 

with 

rjk+i = r]{v^,%), r]k = r]{v,T), Ek = fv^ - vf, (7.5) 
C2 = (1 + ri)Air72(D, To), (1 - a;) = (1 + 5)(1 - A^^). (7.6) 
To ensure that w = 1 - (1 + 5)(1 - XO"^) E (0, 1), we restrict 5 > so that 

< (1 + (5)(1 - A02) < 1, (7.7) 

or so that 

1 r 1 . i-[i-\e^] xe^ 

Since we must also take 5 > 0, we have then the range for 5 is as in (17.31 ) to ensure 
Assumption[6jwith w = 1 - (1 + (5)(1 - A^^) ^ (q, 1). □ 
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Remark 7.5. By first establishing Theorem \6.9\ based only on three simple assumptions 
relating the error and error indicator, the main contraction argument in Theorem \6.9\ 
is general, applies to nonlinear problems, and does not involve details of the adaptive 
algorithm that produces the approximations or the error indicators. The local Lipschitz 
and marking assumptions we use above to establish the indicator reduction assumption 
bring in the details of the particular adaptive algorithm and the problem only at the 
last moment, and helps clarify the impact of the various parameters on the contraction 
argument and rate. 

The supporting results we need are now in place; we can now establish the second of 
the two main convergence results of the paper, this one concerning contraction. 



Theorem 7.6 (Contraction). Let Assumption ( Ouasi-Orthogonality) and As sump - 
tion 1(5. 71 ( Upper-Bound) hold, and assume that the conditions of Lemma \7.4\ hold. Let 
/3 G (0, 1) be arbitrary, and assume the constant in Assumption \6^ satisfies: 

1 ^ A < 1 + -f^, (7.9) 

where the constant Ci is as in Assumption \6.7\ and C2 and u are as in Lemma [Z4l Then 
there exists 7 > and a G (0, 1) such that: 

\\u - Uk+if + Wk+i ^ - Ukf + 7^fc) , (7-10) 

where 7 can be taken to be anything in the non-empty interval 

(A-l)Ci r 1 kCA 

< 7 < min<^ — ,— ^ L (7.11) 



I3uj ' \C2 (3uj 

and where a is subsequently given by 

< = max{al, ol\} < 1, (7.12) 



with 



0<a2_A-^<l, 0<al = l-[l- (3]u <1. (7.13) 

Proof. By Lemma l74l As sumption 17 . 1 1 and Property (13.91 ) together imply that Assump- 
tion [6]8] holds. The result then follows by Theorem |6.9[ □ 

We now apply the Contraction Theorem 17.61 to establish contraction of the adaptive 
algorithm (13.41) for specific nonlinear PDE examples. Note that the more general weak*- 
convergence framework is also applicable to each of these examples, as we discussed 
in Section [51 what we gain here is fixed-rate contraction of the error at each iteration 
of AFEM, and subsequently the possibility of establishing optimality of AFEM. In each 
of the following examples, we use the standard residual error indicator, denoted by 77. 
For the marking strategy in the AFEM algorithm, we use the standard Dorfler marking 
strategy (IX9l) . 

7.2. A Semi-Linear Example. Our first example is a special case of equation (I5.4I ). 

Example 7.7. Let Q G M."^ be a convex polygonal domain, and f G L'^{Q). Consider the 
weak form formulation of the semi-linear equation (15.41) 



Findu G HliVt), s.t. (Vm, Vw) + {u^,v) = {f,v), Vi; G H^{n). (7.14) 
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Here, the solution and test spaces are the Hilbert space X = Y := HliVt). Let = 
Yfc C -f^o(f^) be the continuous piecewise linear finite element spaces defined on Tk, 
which we assume to be an exact partition of Vl. For convenience, we denote a(u, v) := 
(Vm, Vf ) and N(u) = u^. It is not difficult to see that 

{N{u) - N{v),u-v) ^ 0, yu,veH^{n), (7.15) 

\\N{u)-N{v)\\ciHHn),H-Hn)) < h - v\\o,n, Vm, G (7.16) 

The Galerkin approximation of the equation (17.71 ) then reads 

Find Uk e Xk, such that a{uk,Vk) + {N{uk),Vk) = if,Vk), Vf^ G X^. (7.17) 

Existence and uniqueness of solutions to (17.141) and (17.171) follow by standard variational 
or fixed-point arguments, cf. [i64.,38 |. To establish both a priori and a posteriori error es- 
timates, we will need L°° control of the continuous solution u and as well as the discrete 
solutions Uk- 

Lemma 7.8 (Continuous A Priori Estimates). Let u G HliVt) be the exact solution 
todLH). Thenue L^{Q). 

Proof. We split the solution u = v} + u^, where v} is the solution to the linear equation 

Since ^7 is convex, elliptic regularity theory implies G H'^{VL) fl ifQ(f2), hence v! G 
L^iyi). It remains to show that m" G Using arguments similar to Il37ll20l . define 

a = arg max < (c + supw')^ ^ L /3 = arg min < (c + inf u')^ ^ L (7.18) 
c I ) c t J 

Let = (m" — /3)+ := maxjw" — /3, 0} and = (m" — a)^ := min{M" — a, 0}. Then 
obviously 0, G i/g (Q). Hence, for = or = we have 

(Vu", V0) = + n')^ 0) ^ 0. 

This implies ^ || V0|| ^ 0, so = 0. Thus a ^ ^ /9 almost everywhere in Vt. □ 

In order to establish a priori L°° bounds for m^, we require the mesh satisfy the regu- 
larity condition 

a,j= j V0,V0, ^0, (7.19) 



See for example fW\ for a discussion of this condition. We then have the following a 
priori estimate for the discrete solution Uk- 

Lemma 7.9 (Discrete A Priori Estimates). Let Uk G Xk C -f^o(^^) be the exact solution 
to (|7.17l) . Assume the triangulation Tk offl satisfies (17.191) . Then Uk G L°°(f2). 

Proof. See dSTllia. □ 

Lemma IT^ and Lemma 17191 provide a priori L°° bounds for u and Uk- That is, if u and 
Uk are exact solutions to (17.141 ) and (17.171 ), then they must satisfy 

u_(x) ^ u{x),Uk{x) ^ u^{x), for almost every X G ^l, 

where u_,u+ G are fixed a priori bounds. In other words, we know that any so- 
lutions u and Uk to (17.141) and (17.171) can be found in fl ifg (fi), so that we do 
not have to look in the larger space Hq^VI) for u and Uk- We now have the tools in 
place for establishing the following quasi-optimal a priori error estimate for Galerkin 
approximations. 
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Proposition 7.10 (Quasi-Optimal A Priori Error Estimates). Let u and be exact solu- 
tions to (17.141 ) and (17.171) . respectively. If both u, Uk G then we have 

|M-Mfc|i,n< min \u - Xk\i,Q.- 
Proof. Note that the error u — Uk satisfies that 

h{u - Uk, Vk) + {N{u) - N{uk),Vk) = 0, \/vk G Xk. 
Therefore, we have 





a[u 


-Uk,U- 


Uk) 








a{u 


-Uk,U- 


Vk) + a\ 


[u - Uk, 


Vk - Uk) 




\u — 


Uk\i,n\u 


- Vk\i,n 


- {N{u^ 


) -N{uk),Vk - Uk) 




\u — 


Uk\i,n\u 


- Vk\i,n 




ul, U-Uk) + (U^ -ul,U- Vk) 


< 


\u — 




- Vk\i,n- 







Here, we noted that {N{u) - N{uk),u - Uk) ^ by (17.151) and 

{N{u) ~ N{uk),u - Vk) < sup{6u{x) + {1 - 6)uk{x))'^\\u - Uk\\\\u - Vk\\ 

< \u - Uk\i,n\u - Vk\i,n, 

by a priori L°° bounds for u and Uk and the Poincare inequality. Since Xk G Xk is 
arbitrary, we have \u - Uk\i,n < min^^ex, \u - Xk\i,n- □ 

Remark 7.11. We note a major difference between Proposition U . 10\ and Propositio ns. 4\ 
is that Proposition U. 10\ does not require the initial mesh to be sufficiently small; however 
we need the a priori L°° bound for Uk, which was built in Lemma \7S\ 

Using the results in Section [6^ we can now easily establish quasi-orthogonality. 

Lemma 7.12 (Quasi-Orthogonality). Let u be the solution to equation (|7.14l) . and Uk+i 

and Uk be the solutions to (17.171 ) on Tk+i and % respectively. Let Xk C Xk+i, and the 
triangulations Tk satisfy the condition (17.191) . Assume that there exist a ak+i > with 
<^k+i as k ^ OO such that 

\\u - Uk+i\\o,n ^ ak+i\\Vu - Vuk+i\\o,n, (7.20) 

Then there exists a constant C* > 0, such that for sufficiently small h, we have 

\u - Ufc+i|?^n ^ - Ufeli^j^ - \uk+i - Uk\\^, 

where A^+i = (1 - C*ak+iK)-^ > with K = 3sup^g[„_ ||x^l|oo,n- 

Proof. From the definition, a(-, ■) is a symmetric coercive bilinear form. The energy 
norm |||w||| := a{v,v) = \v\Iq is equivalent to the iJ^-norm in Hq(^1). Now we verify 
the Lipschitz continuity (16.321) for N{u) = u^. It follows by the a priori error estimates 
Lemma IT!8] and Lemma |T!9l of u and Uk+i : 

\{N{u) - N{uk+i),Vk+i)\ ^ sup Pxlooll^^ - ^ifc+i||o,n||^fc+i||o,n 

^ K\\u - Uk+i\\o,n\\vk+i\\i,n, 
where K = sup^g[„_ ||3x^||oo < oo. The conclusion follows by Theorem [641 □ 
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Now that the Quasi-Orthogonality As sumption 16 .61 is in place, recall that the residual- 
based a posteriori error indicator for equation (17.141) is given by (15.61) : 

rik{uk,Tf ■=hl\\ul - f\\\^+ ^ K\\[Vuk-n^]\\\^, 

aCdr 

with ?7fc(f , S) := (X]re5 ''")) ^ subset S C Tk- The second ingredient of the 

contraction argument, namely the Upper Bound Assumption I6.7[ is provided by Theo- 
rem [531 To apply the contraction Theorem 17.61 we only need to verify the Local Lips- 
chitz Assumption 17.11 which implies the Indicator Reduction Lemma fT2\ To this end, 
we introduce the PDE-related indicator: 

r]\B,T) := hi sup \\3xYoo,r- 

xe[u-,u+] 

For any subset S C T, let rjCD, S) := max^f^slrjCD, r)}. By the definition, it is obvious 
that r](I),T) is monotone decreasing, i.e., 

r/(D,r.) <r7(D,r) (7.21) 

for any refinement % of T. 

Lemma 7.13 (Local Lipschitz). Let T be a conforming partition. For all t ^ T and for 
any pair of discrete functions v,w E m+] fl X{T), it holds that 

\r]{v,T) - 7]{w,t)\ ^ Ai'r](D,r)\v -w\i^^^, (7.22) 

where Ai > depends only on the shape-regularity of To, and the maximal values that 
can obtain on the -interval 

Proof. By the definition of r/, we have 



aCdr 



Notice that 



\V ~W ||o,r < I sup ||3X ||oo,r | \\v - w\\o,r- 

On the other hand, we also have 

_ 1 

lha ■ [V(t; - w)]||o,a < hr ^ \\Vv - Vw||o,a;,- 

Therefore ,we get the desired estimate for r]. □ 

Combining all of the above we obtain a contraction result for the AFEM algorithm: 

Theorem 7.14 (Contraction and Convergence). Let {Tk, Vk,Uk}k^o be the sequence of 
finite element meshes, spaces, and solutions, respectively, produced by AFEM( 9, 1 ) with 
marking parameter 6 E (0,1] and bisection level £ ^ L Let Hq be sufficiently fine so that 
Lemma \7.12\ holds for {Tk, Vk, Uk}k^o- Then, there exist constants 7 > and a e (0, 1), 
depending only on 9, i, and the shape-regularity of the initial triangulation To, such that 

|w - Uk+i\ln + ivl+i ^ {\u - Uklln + ivl) ■ 
Consequently, we have the following convergence of AFEM algorithm: 

\u - Uk\ln + Wk < CqO^^, 
for some constant Cq = Co{uo, ho,9,l,To)- 

Proof. The results follow from Theorems |7]6l □ 



40 



M. HOLST, G. TSOGTGEREL, AND Y. ZHU 



7.3. The Poisson-Boltzmann Equation. The second example we consider is the non- 
linear Poisson-Boltzmann equation (PBE), which is widely used for modeling the elec- 
trostatic interactions among charges particles; it is important in many areas of science 
and engineering, including biochemistry and biophysics. The nonlinear PBE is 

-V ■ (eVn) + sinhu = /, in Q, 

u = 0, ondn, ^ ^ ^ 

where / = Xlili Qi^i^i)^ with Xj G Vt^ C Vt. Here, e = e(x) > is a strictly positive 
spatially-dependent dielectric coefficient, with the modified Debye-Huckel constant tak- 
ing the value k = in the solute (molecule) region Vtrn and then strictly positive in the 
solvent region Vtg '■= ^\ ^m- We will denote the interface between the molecular and 
solvent regions as F = d^lm- 

One of the main analysis and approximation theory difficulties with the PBE arises 
from the singular function /, which does not belong to H^^(n); this implies (17.231) does 
not have a solution in H^{il), or at least does not have a normal weak formulation 
with test functions coming form H^. To address this and other features of the PBE, Chen, 
Xu and Hoist [|20ll used a two-scale decomposition (see also [12711731 ) to split the solution 
into a self-energy corresponding to the electrostatic potential u^, and a screening potential 
due to high dielectric and mobile ions in the solution region. The singular component 
of the electrostatic potential satisfies 

N 

-V-(e^VM^) = 5^g,5(x,), (7.24) 

1=1 

which can be assembled from the Green's functions u'^ := J^iLi <li/{^m\x — Xi\). Sub- 
tracting (17.241) from (17.231) gives the equation for u: 

- V • (eVw) + sinh(u + u') = V ■ ((e - e„)Vn'). (7.25) 

In ll20l . a new solution theory, approximation theory, and convergent AFEM algorithm 
for the nonlinear PBE was established, based on this decomposition. However, it was 
discovered later numerically that this decomposition requires that the regular component 
must be solved at an extremely high accuracy. This defect is built into the decomposition 
itself due to the large scale separation between the two components of the splitting. A 
related decomposition scheme without this stability problem was studied numerically 
for finite difference schemes in ['2T|, and then analyzed carefully in [[3T1l . This 3-term 
decomposition uses the same first component in the molecular as defined in (17.241) ; 
the second component is the harmonic extension of the trace of on V (the interface 
between Vt^ and Vts) into the molecular region, with satisfying 



-Am^ = in a 
u'^ = — on r 



mi 



(7.26) 



One sets + u'^ = in Vlg, with the harmonic extension continuous across the 
interface by construction. The remaining regular component satisfies the Regularized 
PBE (RPBE): 

-V ■ (eVw) + sinh(M) = in 

du 



[u\r = and 



driY 



J r 



onr , (7.27) 



where 



9r = gv-=£ 



u\dn = g ondil. 



m 



drir 
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Apart from the techniques required to handle the singular features described above, the 
remaining complexities (the discontinuous dielectric and Debye-Huckel constants and 
super-critical nonlinearity) can be handled directly by the framework described in this 
paper; in particular, both forms of the regularized problem (|7.25l) and (17.271) . analyzed 
in [f20| and in [[3T| respectively, fit into the class of semilinear problems described in 
Section 17.21 The results remaining to be established for use of the AFEM contraction 
framework essentially all follow from Lipschitz control of the nonlinearity (16.321) : this 
control is gained through establishing continuous and discrete a priori L°° estimates for 
the weak solution u to (17.251 ) and (17.271 ). and for the Galerkin approximation Uk of these 
solutions. Such a priori L°° estimates are established in analyzed in [|20l and [|3T1l . fol- 
lowing cutoff-function arguments similar to those used in Lemma fhS\ above. Both the 
quasi-orthogonality result in Theorem 16.41 and the nonlinear local Lipschitz Assump- 
tion |7]T] follow from the these priori L°° bounds; for details see [31 1. Contraction (hence 
convergence) of AFEM then follows by the contraction Theorem 17.61 see [|3T]| for the 
complete argument. For a short derivation of the equation, and a more detailed discus- 
sion of the solution theory, the approximation theory, and adaptive methods, see EOllSTI . 

7.4. The Hamiltonian Constraint Equation. The third example we consider is the 
scalar Hamiltonian constraint equation, which together with the vector momentum con- 
straint, appears as the coupled Einstein constraint equations which arise in general rel- 
ativity. The derivation of the constraint equations is based on a conformal decomposi- 
tion technique, introduced by Lichnerowicz and York [|4T1 iTOl ITTll . In certain physical 
situations (constant mean extrinsic curvature of the 3-manifold spatial domain), the con- 
straints decouple so that the (linear) momentum constraint can be solved first for a vector 
potential w, leaving the Hamiltonian constraint to be solved separately for a scalar con- 
formal factor u. Let i7 C M'^ be bounded and polyhedral, with d ^ 2. We consider then 
AFEM algorithms the scalar Hamiltonian constraint equation: Find u such that 

-V • (AVu) + N{u) = inn, 

n-{AVu) + Glu) = on dN^l, (7.28) 
u = on dD^t. 

The boundary conditions of primary interest in both mathematical and numerical rela- 
tivity include the cases dD^t = or = 0, which covers various combinations of 
boundary conditions considered in the literature [172114411231 for the constraint equations. 
The tensor A is a Riemannian metric, so it appears here as a uniformly positive definite 
symmetric matrix function on ^7: 

ciier ^ M^Mj ^ csier, a.e. in Q, (7.29) 

with component functions Aij E L°°. The principle part of equation ([7.28[) is the 
Laplace-Beltrami operator with certain Riemannian metric hab- The (nonlinear) bound- 
ary function G is assumed to be C^(9Arfi). The nonlinear function A^(-) in the Hamilton- 
ian constraint equation reads: 

N{(f)) = aR(j) + a^0^ - apcj)"^ - a^0"'^, 

where Ot-, Op, G H^^iVt) are nonnegative functions, and aR := |-R G H^^iVt), with 
the scalar curvature R of the metric hab- Here 

G Hl{VL) n L°°(fi) with < M_ ^ M+ < oo. 

The construction of the subsolution m and the supersolution for the constraint equa- 
tions was discussed in detail in [i33i[ . 
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Note that N{u) is well-defined only on essentially bounded subsets of : 

: u+] C L'^in) H^\n). (7.30) 
Such a restriction will give rise to a Lipschitz property of on this set: 

yu,ve[u_,u+]nL\n), (7.31) 

which is a key tool for controlling the nonlinearity in the solution theory, when combined 
with a priori L°° bounds to establish the interval as we saw in Section 17121 

A weak formulation of equation (|7.28l) is then: Find u E u^] fl Hjj^^l) such that 

a{u,v) + {f{u),v)=0, \/veH},{n), (7.32) 

where 

a{u,v) = / AVu-Vvdx, 
Jq 

{f{u),v) = / N{u)vdx+ / G{u)vds. 
Jq JdM^ 

Thanks to the control of the nonlinearity provided by the a priori L°° bounds we es- 
tablished on any solution u to the Hamiltonian constraint, it was showed in [|33l that 
equation (|7.32l) is a well-posed problem. In particular, there exists a solution u G 

[u^,u+]nHhm. 

The remaining ideas in design the AFEM algorithm and its convergence analysis are 
the same as before. Namely, we develop the a priori L°° bounds of u and the finite 
element approximation Uk- Based on these a priori bounds, we then establish quasi- 
orthogonality and the local Lipschitz property. Finally, contraction and convergence of 
the AFEM algorithm follows by the contraction Theorem 17.61 For a detailed discussion 
of the equation, the solution theory, approximation theory, and convergence analysis of 
AFEM, see [|35l 1331. 

8. Conclusion 

In this article we developed convergence theory for a general class of adaptive approx- 
imation algorithms for abstract nonlinear operator equations on Banach spaces, and then 
used the theory to obtain convergence results for practical adaptive finite element meth- 
ods (AFEM) applied to a several classes of nonlinear elliptic equations. In the first part 
of the paper, we developed a weak-* convergence framework for nonlinear operators, 
whose Gateaux derivatives are locally Lipschitz and satisfy a local inf-sup condition. 
The framework can be viewed as extending the recent convergence results for linear 
problems of Morin, Siebert and Veeser to a general nonlinear setting. We formulated 
an abstract adaptive approximation algorithm for nonlinear operator equations in Banach 
spaces with local structure. The weak-* convergence framework was then applied to this 
class of abstract locally adaptive algorithms, giving a general convergence result. The 
convergence result was then applied to a standard AFEM algorithm in the case of sev- 
eral semilinear and quasi-linear scalar elliptic equations and elliptic systems, including 
a semilinear problem with polynomial nonlinearity, the steady Navier-Stokes equations, 
and a more general quasilinear problem. This yielded several new AFEM convergence 
results for these nonlinear problems. 

In the second part of the paper, we developed a second abstract convergence frame- 
work based on strong contraction, extending the recent contraction results for linear 
problems of Gascon, Kreuzer, Nochetto, and Siebert and of Mekchay and Nochetto to 
abstract nonlinear problems. We then established conditions under which it is possible to 
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apply the contraction framework to the abstract adaptive algorithm defined earUer, giv- 
ing a contraction result for AFEM-type algorithms applied to nonlinear problems. The 
contraction result was then applied to a standard AFEM algorithm in the case of sev- 
eral semilinear scalar elliptic equations, including a semilinear problem with polynomial 
nonlinearity, the Poisson-Boltzmann equation, and the Hamiltonian constraint in general 
relativity, yielding AFEM contraction results in each case. 
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